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Preface 


This text is designed for a two-semester course in data Structures and programming. 
For several years, we have taught a course in data structures to students who have had a 
semester course in high-level language.programming and a semester course in assembly 
language programming. We found that a considerable amount of time was spent in 
teaching programming techniques because the students had not had sufticient exposure 
to programming and were unable to implement abstract structures on their own. The 
brighter students eventually caught on to what was being done. The weaker students 
never did. Based on this experience, we have reached the firm conviction that a first 
course in data structures must go hand in hand with a second course in programming. 
This text is a product of that conviction. 

The text introduces abstract concepts, shows how those concepts are useful in 
problem solving, and then shows how the abstractions can be made concrete by us- 
ing a programming language. Equal emphasis is placed on both the abstract and the 
concrete versions of a concept, so that the student learns about the concept itself, its 
implementation, and its application. 

The languages used in this text are C and C++. Cis well suited to such a course 
since it contains the control structures necessary to make programs readable and allows 
basic data structures such as stacks, linked lists, and trees to be implemented in a variety 
of ways. This allows the student to appreciate the choices and tradeoffs which face a 
programmer in a real situation. C is also widespread on many different computers and 


it continues to grow in popularity. As Kernighan and Ritchie indicate, C is “a pleasant, 
expressive, and versatile language.” 


xiii 


We have included information on C+ + in the early chapters, introducing the fea- 
tures of C+ + and showing how they can be used in implementing data structures. No 
specific background in C++ is needed. Classes in C++ are introduced in a new Sec- 
tion 1.4. This section discusses classes. including function members. It also introduces 
inheritance and object orientation. The section includes an exumple of implementing 
abstract data types in C++, as well as polymorphism. To Section 2.3 we have added an 
implementation of stacks in C+ + using templates. This shows how complex data struc- 
tures can be parameterized for different base types. A new Section 4.6 has been added, 
showing how linked lists can be implemented in C+ +. Such an implementation shows 
the limitations, as well as the power, of encapsulation in implementing data structures. 
The peint should be made that encapsulated data structures must be designed carefully 
to allow users to do what they need in a data structure. Also discussed in this context 
are C++ dynamic allocation and freeing of storage. 

The only prerequisite for students using this text is a one-semester course in pro- 
gramming. Students who have had a course in programming using such languages as 
FORTRAN, Pascal. or PL/I can use this text together with one of the elementary C or 
C++ texts listed in the Bibliography. Chapter | also provides information necessary 
for such students to acquaint themselves with C. 

Chapter 1 is an introduction to data structures. Section 1.1 introduces the concept 
of an abstract data structure and the concept of an implementation. Sections 1.2 and 
1.3 introduce arrays and structures in C. The implementations of these two data struc- 
tures as well as their applications are covered. Chapter 2 discusses stacks and their 
C implementation. Since this is the first new data structure introduced. considerable 
discussion of the pitfalls of implementing such a structure is included. Section 2.3 in- 
troduces postfix, prefix, and infix notations. Chapter 3 covers recursion, its application, 
and its implementation. Chapter 4 introduces queues, priority queues, and linked lists 
and their implementations both using an array of available nodes as well as using dy- 
namic storage. Chapter 5 discusses trees, Chapter 6 introduces O notation and covers 
sorting, while Chapter 7 covers both internal and external searching. Chapter 8 intro- 
duces graphs, and Chapter 9 discusses storage management. At the end of the text, we 
have included a large Bibliography with each entry classified by the appropriate chapter 
or section of the text. 

A one-semester course in data structures consists of Section 1.1, Chapters 2 
through 7, and Sections 8.1, 8.2, and part of Section 8.4. Parts of Chapters 3, 6, 7, and 
8 can be omitted if time is pressing. 

This text is suitable for courses based upon the Algorithms and Data Structures 
knowledge unit (AL 1-6, 8) as well as sections of the Programming Languages knowl- 
edge unit (PL 3-6, 10, 11) as described in the report Computing Curricula 1991 of the 
ACM/IEEE-CS Joint Curriculum Task Force. It follows closely the sample Data Struc- 
tures and Analysis of Algorithms course presented in the report and may be used in 
second- and third-tier classes of a typical computer science curriculum for both majors 
and nonmajors. 

The text is suitable for course C82 and parts of courses C87 and C813 of Cur- 
riculum 78 (Communications of the ACM, March 1979), courses UC] and UCS of 
the Undergraduate Programs in Information Systems (Communications of the ACM. 
December 1973) and course 11 of Curriculum 68 (Communications of the ACM. March 
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1968). In particular, the text covers parts or all of topics P1, P2, P3, P4, P5. S2. DI, D2. 
D3, and D6 of Curriculum 78. ; 

Algorithms are presented as intermediaries between English language descrip- 
tions and C programs. They are written in C style interspersed with English. These 
algorithms allow the reader to focus on the method’used to solve a problem without 
concern about declaration of variables and the peculiarities of real language. In trans- 
forming an algorithm into a program, we introduce these issues and peint out the pitfalls 
that accompany them. 

The indentation pattern used for programs and algorithms is based loosely on a 
format suggested by Kernighan and Ritchie (The C Programming Language, Prentice 
Hall, 1978) which we have found to be quite useful. We have also adopted the con- 
vention of indicating in comments the construct being terminated by each mae 
a closing brace (}). Together with the indentation pattern, this is a valuable tool in im- 
proving program comprehensibility. We distinguish between algorithms and programs 
by presenting the former in italics and the latter in roman. 

Most of the concepts in the text are illustrated by several examples. Some of 
these examples are important topics in their own right (e.g., postfix notation, multiword _ 
arithmetic, etc.) and may be treated as such. Other examples illustrate different im- 
plementation techniques ( such as sequential storage of trees). The instructor is free to 
cover as many or as few of these examples as he or she wishes. Examples may also 
be assigned to students as independent reading. It is anticipated that an instructor will 
be unable to cover all the examples in sufficient detail within the-confines of a one- or 
two-semester course. We feel that at the stage of a student's development for which the 
text is designed, it is more important to cover several examples in great detail than to 
cover a broad range of topics cursorily. 

All the programs and algorithms in this text have been tested and debugged. We 
wish to thank Miriam Binder and Irene LaClaustra for their invaluable assistance in this 
task. Their zeal for the task was above and beyond the call of duty and their suggestions 
were always valuable. Of course, any errors that remain are the sole responsibility of 
the authors. 

The exercises vary widely in type and difficulty. Some are drill exercises to en- 
sure comprehension of topics in the text. Others involve modifications of programs or 
algorithms presented in the text. Still others introduce new concepts and are quite chal- 
lenging. Often, a group of successive exercises includes the complete development of 
a new topic which can be used as the basis for a term project or an additional lecture. 
The instructor should use caution in assigning exercises so that an assignment is suit- 
able to the student’s level. We consider it imperative for students to be assigned several 
(from five to twelve, depending on difficulty) programming projects per semester. The 
exercises contain several projects of this type. J 

We have attempted to use the C language, as specified in the second edition of 
the Kernighan and Ritchie text. This corresponds to the C ANSI Standard., Programs 
given in this book have all been developed using Borland C++ but have only made 
use of features as described in the evolving ANS] C++ draft standard. They should 
run without change on a wide variety of C++ compilers. See the reference manual for 
your particular system or consult the “Working Paper for Draft Proposed International 
Standard for Information System—Programming Language C++.” available from the 
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American National Standards Institute (ANS!) Standards Sccretariat: CBEMA, 1250 
Eye Street NW, Suite 200, Washington, DC 20005. You should, of course, warn your 
students about any idiosyncrasies of the particular compiler they are using. We have 
also added some references to several personal computer C and C++ compilers. 

Miriam Binder and Irene LaClaustra spent many hours typing and correcting the 
original manuscript as well as managing a large team of students whom we mention 
below. Their cooperation and patience as we continually made up and changed our 
minds about additions and deletions are most sincerely appreciated. 

We would like to thank Shaindel Zundel-Margulis, Cynthia Richman, Gittie 
Rosenfeld-Wertenteil, Mindy Rosman-Schreiber, Nina Silverman, Helene Turry, and 
Devorah Sadowsky-Weinschneider for their invaluable assistance. 

Vivienne Esther Langsam and Tziyonah Miriam Langsam spent many hours re- 
vising the index for the second edition of this book. We would like to thank them for 
helping us complete the book in the face of a fast approaching deadline. 

The staff of the City University Computer Center deserves special mention. They 
were extremely helpful in-assisting us in using the excellent facilities of the center. The 
same can be said of the staff of the Brooklyn College Computer Center, 

We would like to thank the editors and staff at Prentice Hall and especially the 
reviewers for their helpful comments and suggestions. 

Finally, we thank our wives, Vivienne Esther Langsam, Gail Augenstein, and 
Miriam Tenenbaum, for their advice and encouragement during the long and arduous 
task of producing such a book. 

YEDIDYAH LANGSAM 
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AARON M. TENENBAUM 


—_—_—_—_— 


To my wife, Vivienne Esther 
YL 


To my wife, Gail 
MA 


To my wife, Miriam 
AT 


qoOoqOqWqoooqOqmqmqaa— 


Preface 


xvi 


Introduction to Data 


Structures 


I 


A computer is a machine that manipulates information. The study of computer science 
includes the study of how information is organized i in a computer, how it can be ma- 
nipulated, and how. it can be utilized, Thus, it is exceedingiy important for a student of 
computer science to understand the concepts of information organization and manipu- 
lation in order to continue study of the field. 


1.1 INFORMATION AND MEANING 


if computer science is fundamentally the study of information, the first question that 

arises is, what is information? Unfortunately, although the concept of information is the 

bedrock of the entire field, this question cannot be answered precisely. In this sense the . 
concept of information in computer science is similar to the concepts of point, line, and 

plane in geometry: they are all undefined terms about which statements can be made 

but which cannot be explained in terms of more elementary concepts. 

In geometry it is possible to talk about the length of a line despite the fact that 
the concept of a line is itself undefined. The length of a line is a measure of quantity. 
Similarly, in computer science we can measure quantities of information. The basic 
unit of information is the bit, whose value asserts one of two mutually exclusive pos- 
sibilities. For example, if a light switch can be in one of two positions but not in both 

| Simultaneously, the fact that it is either in the “on™ position or the “off position is one 
bit of information. If a device can be in more than two possible states, the fact that it 
is ina particular state is more than one bit of information. For example, if a dial has 


1 


eight possible positions, the fact that it is in position 4 rules out seven other possibilities, 
whereas the fact that a light switch is on rules out only one other possibility. 

Another way of thinking of this phenomenon is as follows. Suppose that we had 
only two-way switches but could use as many of them as we needed. How many such 
switches would be necessary to represent a dial with eight positions? Clearly, one switch 
can represent only two positions (see Figure 1.1.1a). Two switches can represent four 
different positions (Figure 1.1.1b), and three switches are required to represent eight 
different positions (Figure 1.1.1c). In general, n switches can represent 2” different 
possibilities. ‘ 

The binary digits 0 and 1 are used to represent tho two possibie states of a par- 
ticular bit (in fact, the word “bit” is a -ontraction of the words “binary digit”), Given n 
bits, a string of n Is and Os is used to represent their settings. For example, the string 
101011 represents six switches, the first of which is “on” (1), the second of which is 
“off” (0), the third on, the fourth off, and the fifth and sixth on. 

We have seen that three bits are sufficient to represent eight possibilities. The 
eight possible configurations of these three bits (000, 001, 010, 01!, 100, 101, 110, 
and 111) can be used to represent the integers 0 through 7. However, there is nothing 
about these bit settings that intrinsically implies that a particular setting represents a 
particular integer. Any assignment of integer values to bit settings is valid as long as 
no two integers are assigned to the same bit setting. Once such an assignment has been 
made, a particular bit setting can be unambiguously interpreted as a specific integer. 
Let us examine severai widely used methods for interpreting bit settings as integers. 


Binary and Decimal Integers 


The most widely used method for interpreting bit settings as nonnegative in: 

is the binary number system. In this system each bit position represents a power of 2. 
The rightmost bit position represents 2° which equals 1, the next position to the left 
represents 2! which is 2, the next bit position represents 2? which is 4, and so on. An 
integer is represented as a sum of powers of 2. A string of all Os represents the number 0. 
If a l appears in a particular bit position; the power of 2 represented by that bit position 
is included in the sum; but if a 0 appears, that power of 2 is not included in the sum. For 
example, the group of bits 00100110 has 1s in positions 1, 2, and 5 (counting from right 
to left with the rightmost position counted as position 0). Thus 00100110 represents ti: 
integer 2! + 2? + 2° = 2 + 4 + 32 = 38. Under this interpretation, any string of bits 
of length n represents a unique nonnegative integer between 0 and 2” — 1, and any 
nonnegative integer between 0 and 2” — 1 can be represented by a unique string of bits 
of length n. 

“There are two widely used methods for representing negative binary numbers. In 
the first method, called ones complement notation, a negative number is represented 
by changing each bit in its absolute value to the opposite bit setting. For example. 
since 00100110 represents 38, 11011001 is used to represent —38. This means that the 
leftmosf bit of a number is no longer used to represent a power of 2 but is reserved 
for the sign of the number. A bit string starting with a 0 represents a positive number. 
whereas a bit string starting with a 1 represents a negative number. Given n bits. the 
range of numbers that can be represented is —2""" + 1 (a 1 followed by 7 — | zeros) to 
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(a) One switch (two possibilities). 


Switch 1 Switch 2 


ETA 


(b) Two switches (four possibilities). 
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(c) Three switches (eight possibilities). 


Figure 1.1.1 


Information and Meaning 


2'"-) — 1 (a 0 followed by n— 1 ones). Note that under this representation, there are two 
representations for the number 0: a “positive” 0 consisting of all Os, and a “negative” 0 
consisting of all Is. 

The second method of represei. -g negative binary numbers is called twos com- 
plement notation. In this notation, 1 is added to the ones complement representation of 
a negative number. For example, since 11011001 represents —38 in ones complement 
notation, 11011010 is used to represent —38 in twos complement notation. Given n bits, 
the range of numbers that can be represented is —2""~) (a 1 followed by n — J zeros) 
to 207D — 1 (a 0 followed by n — 1 ones). Note that —2""~") can be represented in twos 
complement notation but not in ones complement notation. However, its absolute value 
2'"~" cannot be represented in either notation using n bits. Note also that there is only 
one representation for the number 0. using n bits in twos complement notation. To see 
this, consider 0 using eight bits: 00000000. The ones complement is 11111111, which 
is negative 0 in that notation. Adding | to produce the twos complement form yields 
100000000, which is nine bits long. Since only eight bits are allowed, the leftmost bit 
(or “overflow’’) is discarded, leaving 00000000 as minus 0. 

The. binary number system is by no means the only method by which bits can 
be used to represent integers. For example, a string of bits may be used to represent 
integers in the decimal number system, ‘as follows. Four bits can be used to represent 
a decimal digit between 0 and 9 in the binary notation described previously. A string 
of hits of arbitrary length may be divided into consecutive sets of four bits, with each 
Set fepresenting a decimal digit. The string then represents the number that is formed 
by those decimal digits in conventional decimal notation. For example, in this system 
the bit string 00100110 is separated into two strings of four bits each: 0010 and 0110. 
The first of these represents the decimal digit 2 and the second represents the decimal 
digit 6, so that the entire string represents the integer 26. This representation is called 
binary coded decimal. 

One important feature of the binary coded decimal representation of nonnegative 
integers is that not all bit strings are valid representations of a decimal integer. Four 
bits can be used to represent one of sixteen different possibilities, since there are six- 
teen possible states for a set of four bits. However, in the binary coded decimal integer 
representation, only ten of.those sixteen possibilities are used. That is, codes such as 


1010 and 1100, whose binary values are 10 or larger, are invalid in a binary coded 
decimal number. : 


Real Numbers 


The usual method used by computers to represent real numbers is floating-point 
notation. There are many Varieties of floating-point notation and each has individual 
characteristics. The key concept is that a real number is represented by a number, called 
a mantissa, times a base raised to an integer power, called an exponent. The base is 
usually fixed, and the mantissa and exponent vary to represent different real numbers. 
For example, if the base is fixed at 10, the number 387.53 could be represented as 
38753 X 107+. (Recall that 107 is .01.) The mantissa is 38753, and the exponent is 
—2. Other possible representations are .38753 X 10% and 387.53 X 10°. We choose the 
representation in which the mantissa is an integer with no trailing Os. 
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In the floating-point notation that we describe (which, is not-necessarily im- 
plemented on any particular machine exactly as described), a real number- is tep- 
resented by a 32-bit string consisting of a 24-bit, mantissa followed. by an 8-bit 
exponent. The base is fixed at 10, Both the mantissa.and the exponent are twos, com- 
plement binary integers. For example, the 24-bit. binary. representation-of 38753 is 
00000000100101 1101100001, and the 8-bit twos complement binary representation of 
~2 is 11111110; the representation of 387.53 is 00000000100101110110000111111110. 
Other real numbers and their floating-point representations are as follows; 


$ i bi innit 
100  00000000000000000000000100000010 -+ 
5 000000000000000000000 101111111 > 
000005  00000000000000000000010111111010. 
12000 000000000000000000001 10000000011 
~387.53 11111111011010001001111111111110., 
—12000  11111111111111111111010000000014 : 


, je 


The advantage of floating-point notation is that it can be used torepresent numbers 
with extremely large or extremely small absolute: values. For example, in the notation 
»resented previously, the largest number that can be represenied ig, A231) XAP, 
which is a very large number indeed. The smallest positive. number that can be rep- 
resented is 10~!28, which is quite small. The limiting factor on the precision to which 
numbers can be represented on a particular machine is the number of significant binary 
digits in the mantissa. Not every number between the largest and the smallest can be 
represented. Our representation allows only 23 significant bits. Thus, a number such 
as 10 million and 1, which requires 24 significant binary digits in the mantissa, would 


have to be approximated by 10 million (1 X 10’), which only requires one significant 
digit. 


Character Strings 


As we all know, information is not always interpreted numerically. Items such as 
names, job titles, and addresses must also be represented in some fashion within a com- 
puter. To enable the representation of such nonnumeric objects, still another method of 
interpreting bit strings is necessary. Such information is usually represented in charac- 
ter string form. For example, in some computers, the eight bits 00100110 are used to 
represent the character *&’. A different eight-bit pattern is used to represent the char- 
acter ‘A’, another to represent ‘B’, another to represent ‘C’, and still another for each 
character that has a representation in a particular machine, A Russian machine uses 
bit patterns to represent Russian characters, whereas an Israeli machine uses bit pat- 
terns to represent Hebrew characters. (In fact, the characters being used are transparent 
to the machine: the character set carr be changed by using a different font set on the 
printer.) i 

If eight bits are used to represent a character, up,to 256 different characters can 
be represented, since there are 256 different eight-bit patterns. If the string 11000000 is 
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used to represent the character ‘A’ and 11000001 is used to represent the character ‘B’, 
the character string “AB” would be represented by the bit string 11000000] 1000001. 
In general, a character string (STR? is represented by the concatenation of the bit strings 
that represent the individual charac rs of the string. 

As in the case of integers, there is nothing about a particular bit string that makes it 
intrinsically suitable for representing a specific character. The assignment of bit strings 
to characters may be entirely arbitrary, but it must be adhered to consistently. It may 
be that some convenient rule is used in assigning bit strings to characters. For exam- 
ple, two bit strings may be assigned to two letters so that the one with a smaller binary 
value is assigned to the letter that comes earlier in the alphabet. However, such a rule is 
merely a convenience; it is not mandated by any intrinsic relation between characters 
and bit strings. In fact, computers even differ over the number of bits used to represent 
a character. Some computers use seven bits (and therefore allow only ` >to 128 possible 
characters), some use eight (up to 256 characters), and some useten p to 1024 pos- 
sible characters). The number of bits necessary to represent a character in a particular 
computer is called the byte size and a group of bits of that number is called a byte. 

Note that using eight bits to represent a character means that 256 possible char- 
acters can be represented. It is not very often that one finds a computer that uses so 
many different characters ( although it is conceivable for a computer to include upper- 
and lowercase letters, special characters, italics, boldface, and-other type characters), 
so that many of the eight-bit codes are not used to represent characters. 

Thus we see that iniormation itself has no meaning. Any meaning can be assigned 
to a particular bit pattern, as long as it is done consistently. It is the interpretation of a bit 
pattern that gives it meaning. For example, the bit string 00100110 can be interpreted 
as the number 38 (binary), the number 26 ( binary coded decimal), or the character ‘&’. 
A method of interpreting a bit pattern is often called a data type. We have presented 
several data types: binary integers, binary coded decimal nonnegative integers, real 
numbers, and character strings. The key questions are how to determine what data types 


are available to interpret bit patterns and what data type to use in interpreting a particular 
bit pattern. 


Hardware and Software 


The memory (also called storage or core) of a computer is simply a group of bits 
(switches). At any instant of the computer's operation any particular bit in memory is 
either 0 or 1 (off or on). The setting of a bit is called its value or its contents. 

The bits in a computer memory are grouped together into larger units such as 
bytes. In some computers, several bytes are grouped together into units called words. 
Each’Such unit (byte or word, depending on the machine) is assigned an address, that 
is, a name identifying a particular unit among all the units in memory. This address is 
usually numeric, so that we may speak of byte 746 ór word 937. An address ‘is often 
called a location, and the contents of a location are the values of the bits that make up 
the unit at that location. 

- Every computer has a set of “native” data types. This means that it is constructed 
with a mechanism for manipulating bit patterns consistent with the objects they repre- 
sent. For example, suppose that a computer contains an instruction to add two binary 
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integers and place their sum at a given location in memory for subsequent use. Then 
there must be a mechanism built into the computer to 


1. Extract operand bit patterns from two given locations. 


2. Produce a third bit pattern representing the binary integer that is the sum of the 
two binary integers represented by the two operands. 
3. Store the resultant bit pattern at a given location. 


The computer “knows” to interpret the bit paiterns at the given locations as binary 
integers because the hardware that executes that particular instruction is designed to do 
so. This is akin to a light “knowing” to be on when the switch is in a particular position. 

If the same machine also has an instruction to add two real numbers, there must 
be a separate built-in mechanism to interpret operands as real numbers. Two distinct in- 
structions are necessary for the two operations, and each instruction carries within itself 
an implicit identification of the types of its operands as well as their explicit locations. 
Therefore it is the programmer’s responsibility to know which data type is contained 
in each location that is used. It is the programmer's responsibility to choose between 
using an integer or real addition instruction to obtain the sum of two numbers. 

A high-level programming language aids in this task considerably. For example, 
if a C programmer declares 


int x, y; 
float a, b; 


space is reserved at four locations for four different numbers. These four locations may 
be referenced by the identifiers x, y, a, and b. An identifier is used instead of 2 numer- 
ical address to refer to a particular memory location because of its convenience for the 
programmer. The contents of the locations reserved for x and y will be interpreted as 
integers, whereas the contents of a and b.will be interpreted as floating-pojnt numbers. 


The compiler that is responsible for translating C programs into machine language will 
translate the “+” in the statement 


X=X+#/Y; 
into integer addition, and will translate the “+” in the statement 


a=a+b; 


into floating-point addition. An operator such as “+” is really a generic operator because 
it has several different meanings depending on its context. The compiler relieves the 
programmer of specifying the type of addition that must be performed by examining 
the context and using the appropriate version. 

It is important to recognize the key role played by declarations in a high-level lan- 
guage. It is by means of declarations that the programmer specifies how the contents of 
the computer memory are to be interpreted by the program. In doing this. a declaration 
specifies how much memory is needed for a particular entity, how the contents of that 
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memory are'to be interpreted, and other vital details. Declarations also specify to the 
compiler exactly what is meant by the operation symbols that are subsequently used. 


Concept of Implementation 


Thus far we have been viewing data types as a method of interpreting the memory 
contents of a computer. The set of native data types that a particular computer can 
Support is determined by what functions have been wired into its hardware. However, 
we can view the concept of “data type” from a completely different perspective; not 
in terms of what a computer can do, but in terms of what the user wants done. For 
example, if one wishes to obtain the sum of two integers, one does not care very much 
about the detailed mechanism by which that sum will be obtained. One is interested in 
manipulating the mathematical concept of an “integer,” not in mi sulating hardware 
bits. The hardware of the computer may be used to represent an inicger, and is useful 
only insofar as the representation is successful. i 

Once the concept of “data type” is divorced from the hardware capabilities of 
the computer, a limitless number of data types can be considered. A data type is an 
abstract concept defined by a set of logical properties. Once such an abstract data type 
is defined and the legal operations involving that type are specified, we may implement 
that data type (or a close approximation to it). An implementation may be a hardware 
impleraentation, in which the circuitry necessary to perform the required operations is 
designed and constructed as part of a computer; or it may be a software implementation, 
in which a program consisting of already existing hardware instructions is written to 
interpret bit strings in the desired fashion and to perform the required operations. Thus, a 
software implementation includes a specification of how an object of the new data type 
is represented by objects of previously existing data types, as well as a specification 
of how such an object is manipulated in conformance’ with the operations defined for 
it. Throughout the remainder of this text, the term “implementation” is used to mean 
“software implementation.” 


Example 


We illustrate these concepts with an example. Suppose that the hardware of a 
computer contains an instruction 


MOVE (source, dest, length) 


that copies a character string of length bytes from an address specified by source to an 
address specified by dest. (We present hardware instructions using uppercase letters. 
The length must be specified by an integer, and for that reason we indicate it with 
lowercase letters. source and dest can be specified by identifiers that represent storage 
locations.) An example of this instruction is MOVE(a,b,3), which copies the three bytes 
Starting at location a to the three bytes starting at location b. 

Note the different roles played by the identifiers a and b in this operation. The 
first operand of the MOVE instruction is the contents of the location specified by the 
identifier a. The second operand, however, is not the contents of location b, since these “ 
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contents are irrelevant to the execution of the instruction. Rather, the location itself is 
the operand, since the location Specifies the destination of the character string. Although 
anidentifier always stands for a location, it is common for an identifier to be used to 
reference the contents of that location. It is always apparent frem the context whether 
an identifier is referencing a location or its contents. The identifier appearing as the first 
operand of a MOVE instruction refers to the contents of memory, whereas the identifier 
appearing as the second operand refers to a location. a : 

We also assume the computer bardware to contain the usual arithmetic and 
branching instructions, which we indicate by using C-like notation. For example, the 
instruction 

I H i 
Z= x+y; 


interprets the contents of the bytes at locations x and y as binary integers, adds them, 
and inserts the binary representation of their sum into the byte at location z. (We do 
not operate on integers greater than one byte in Jeagth and ignore the possibility of 
overflow.) Here again, x and y are used to reference memory contents, whereas z is 
used to reference a memory location, but the proper interpretation is clear from the 
context, n 

Sometimes itis desirable toiadd a quantity to an address to obtain another address. 
For example, if a is a location in memory, we might want to reference the location four 
bytes beyond a. We cannot refer to this location as a +4, since that notation is reserved 
for the integer contents of location a +4. We therefore introduce the notation a[4] to 
refer to this location. We also introduce the notation a[.x] to refer to the address given 
by adding the binary integer contents of the byte at.x to the address a: | 

The MOVE instruction requires the programmer to specify the length of the string 
to be copied. Thus, its operand is a fixed-length character string (that is, the length of 
the suing must be known). A fixed-length string and a byte-sized binary integer may 
be considered native data types of this particular machine. 

Suppose that we wished to implement varying-length character strings on this 
machine. That is, we want to enable programmers to use an instruction 


MOVEVAR(source dest) 


to move a-character string from location source to location dest without being required 
to specify any length. š 

To implement this new data type. we must first decide on how it is to be repre- 
sented in the memory of the machine and then indicate how that representation is to 
be manipulated. Clearly, it is necessary to know how many bytes must be moved to 
execute this instruction. Since the MOVEVAR operation does not specify this number, 
the number must be contained within the representation of the character string itself. 
A varying-length character string of length / may be represented by a contiguous set 
of | + 1 bytes (J < 256). The first byte contains the binary representation of the length 
l and the remaining bytes contain the representations of the characters in the string. 
Representations of three such strings are illustrated in Figure 1.1.2. (Note that the dig- 
its Sand 9 in these figures do not stand for the bit patterns representing the characters i 
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Figure 1.1.2 Varying-length character strings. 


and ‘9° but rather for the patterns 00000161 and 00001001 (assuming eight bits to a 
byte). which represent the integers five and nine. Similarly, 14 iñ Figure 1.1.2c stands 
for the bit pattern 00001110. Note also that this representation is very different from 
the way character strings are actually implemented in C.) 

The program to implement the MOVEVAR operation can be written as follows (i 
is an auxiliary memory location): 


MOVE(source, dest, 1); 
for (i=1; i < dest; i++) 
MOVE(source[i], dest[i], 1); 


Similarly, we can implement an operation CONCATVAR(c1,c2,c3) to concate- 
nate two varying-length character strings at locations cl and c2 and place the result at 
c3. Figure 1.1.2c illustrates the concatenation of the two strings in Figure 1.1.2a and b: 


j* move the length 2y 
Z=cl+ 2; 
MOVE(z, c3, 1); 
/* move the first string */ 
for (i = 1; i <= cl; MOVE(c1[i], c3fi], 1); 
A& move the second string */ 
for (i = 1; i <= c2) { 
x=cl+i; 
MOVE(c2[i], c3[x], 1); 
} /* end for */ 


However, once the operation MOVEVAR has been defined, CONCATVAR can be im- 
plemented using MOVEVAR as follows: 
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MOVEVAR(c2, c3[c1]); /* move the second string */ 


MOVEVAR(c1, c3); j/* move the first string *] 
z= cl +2; /* update the length of the result */ 
MOVE(z, c3, 1); 


Figure 1.1.3 illustrates phases of this operation or: the strings of Figure 1.1.2. Although 
this latter version is shorter, it is not really more efficient, since all the instructions used 
in implementing MOVEVAR are performed each time that MOVEVAR is used. 

The statement z = cl + c2 in both the preceding algorithms is of particular inter- 
est. The addition instruction opcrates independently of the use of its operands (in this 
case, parts of varying-length character strings). The instruction is designed to treat its 
operands as single-byte integers regardless of any other use that the programmer has 
for them. Similarly, the reference to ¢3[{c1] is to the location whose address is given by 
adding the contents of the byte at location cl to the address c3. Thus the byte at cl is 
treated as holding a binary integer, although it is also the start of a varying-length char- 
acter strin 2. This illustrates the fact that a data type is a method of treating the contents 
of memory and that those contents have no intrinsic meaning. 

Note that this representation of varying-length character strings allows only 
strings wnose length is less than or equal to the largest binary integer that fits into a 
single byte. If a byte is eight bits, this means that the largest such string is 255 (that 
is, 25 — 1) characters long. To allow for longer strings, « different representation must 
be chosen and a new set of programs must be written. If we use this representation of 
varying-length character strings, the concatenation operation is invalid if the resulting 
string is more than 255 characters long. Since the result of such an operation is unde- 
fined, a wide variety of actions can be implemented if that operation is attempted. One 
possibility is to use only the first 255 characters of the result. Another possibility is to 
ignore the operation entirely and not move anything ‘to the result field. There is also 
a choice of printing a warning message or of assuming that the user wants to achieve 
whatever result the implementor decides on. 

In fact, C uses an entirely different implementation of character strings that avoids 
this limitation on the length of the string. In C, all strings are terminated by the special 
character ‘\ 0°. This character, which never appears within a string, is automatically 
placed by the compiler at the end of every string. Since the length of the string is not 
known in advance, all string operations must proceed a character at a time until *\ 0’ is 
encountered. 

The program to implement the MOVEVAR operation, under this implementation, 
can be written as follows: 


i=0; 
while Garett Y= '\o') { 
MOVE(source[i], destji], 1); 
I++; 
} 
dest[i}] = '\0'; 
/* terminate the destination string with '\0' */ 
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(a) MOVEVAR (C2, C3[C1]); =~ 


(b) MOVEVAR (CI, C3); 


(c) Z = C1 + C2; MOVE (Z, C3, 1); 


Figure 1.1.3  CONCATVAR operations. 
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To implement the concatenation operation, CONCATVAR(c1I ,c2,c3), we may write 


1.2 0; ri i 
/* move the first string */ 
while (c1[i] != '\0') { 
MOVE(c1[i], c3[i], 1); 
i++; 
o | ; 
/* move the second string */ 
j= 0; 
while (c2[j] != '\0') 
MOVE(c2[j++], c3[i++], 1); 
/* terminate the destination string with a \0 */ 
Bi] ="\0'; 


A disadvantage of the C implementation of character strings is that the length of a char- 
acter string is not readily available without advancing through the string one character 
at a time until *\ 0° is encountered. This is more than offset by the advantage of not 
having an arbitrary limit placed on the length of the string. J 3 

Once a representation has been chosen for objects of a particular data type and 
routines have been written to operate on those representations, the programmer is free to 
use that data type to solve problems. The original hardware of the machine plus the pro- 
grams for implementing more complex data types than those provided by the hardware 
can be thought of as a “better” machine than the one consisting of the hardware alone. 
The programmer of the original machine need not worry about how the computer is 
designed and what circuitry is used to execute each instruction. The programmer need 
know only what instructions are available and how those instructions can be used. Sim- 
ilarly, the programmer who uses the “extended” machine (that consists of hardware and 
software), or “virtual computer.” as it is sometimes known, need not be concerned with 
the details of how various data types are implemented. All the programmer needs to 
know is how they can be manipulated. 


y 


Abstract Data Types 


A useful tool for specifying the logical properties of a data type is the abstract data 
type, or ADT. Fundamentally, a data type is'a collection of values and a set of operations 
on those values. That collection and those operations form a mathematical construct that 
may be implemented using a particular hardware or software data structure. The term 
“abstract data type” refers to the basic mathematical concept that defines the data type. 

In defining an abstract data type as a mathematical concept, we are not concerned 
with space or time efficiency. Those are implementation issues. In fact. the definition + 
of an ADT is not concerned with implementation details at all. It may not even be 
possible to implement a particular ADT on a particular piece of hardware or using a 
particular software system. For example, we have already seen that the ADT inreger is 
not universally implementable. Nevertheless, by specifying the mathematical and logi- 
cal properties of a data type or structure, the ADT is a useful guideline to implementors 
and a useful tool to programmers who wish to use the data type correctly. i 
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There are a number of methods for specifying an ADT. The method that we use is 
semiformal and borrows heavity from C notation but extends that notation where nec- 
essary. To illustrate the concept of an ADT and our specification method, consider the 
ADT RATIONAL, which corresponds to the mathematical concept of a rational number. 
A rational number is a number that can be expressed as the quotient of two integers. 
The operations on rational numbers that we define are the creation of a rational number 
from two integers, addition, multiplication, and testing for equality. The following is an 
initial specification of this ADT: 


/*value definition*/ 
abstract typedef <integer, integer> RATIONAL; 
condition RATIONAL[1] != 0; 
/*operator definition*/ 
abstract RATIONAL makerational(a,b) 
int a,b; 
precondition b != 0; 
postcondition makerational(0] == a; 
makerational(1) == b; 


abstract RATIONAL add(a,b) /* written a+ b */ 
RATIONAL a,b; 
postcondition add[1] == a[l; * b[1]; 

add[0] == a[0] * b[1] + b[0] * a[l]; 


abstract RATIONAL mult(a, b) /* written a * b */ 
RATIONAL a,b; 
postcondition mult[0] == a[0] * b[0]; 

mult{1) == a[l] * b[1]; 


abstract equal(a,b) /* written a == b */ 
RATIONAL a,b; 


Postcondition equal == (a[0]*b[1] == b[0)*a[1}); 


An ADT consists of two parts: a value definition and an operator definition. The 
value definition defines the collection of values for the ADT and consisis of two parts: a 
definition clause and a condition clause. For example, the value definition for the ADT 
RATIONAL states that a RATIONAL value consists of two integers, the second of which 
does not equal 0. Of course, the two integers that make up a rational number are the 
numerator and the denominator. We use array notation (square brackets) to indicate the 
parts of an abstract type. _ 

The keywords abstract typedef introduce a value definition, and the keyword con- 
dition is used to specify any conditions on the newly defined type..In this definition, the 
condition specifies that the denominator may not be 0. The definition clause is required, 
but the condition clause may not be necessary for every ADT. 

Immediately following the value detinition comes the operator definition. Each 
operator is defined as an abstract function with three parts: a header, the opticnal pre- 
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conditions, and the postconditions. For example, the operator definition of the ADT RA- 
TIONAL includes the operations of creation (makerational), addition (add) and multi- 
plication (mult), as well as a test for equality (equal). Let us consider the specification 


for multiplication first, since it is the simplest. It contains a header and postconditions, 
but no preconditions: 


abstract RATIONAL mult(a,b) /* written a*b */ 
RATIONAL a,b; 
postcondition mult[0] == a[0]*b[0); 

muit({i) == a{1)*b[1); 


The header of this definition is the first two lines, which are just like a C function header. 
The keyword abstract indicates that this is not a C function but an ADT operator defi- 
nition. The comment beginning with the new keyword written indicates an alternative 
way of writing the function. 

The postcondition specifies what the operation does. In a postcondition, the name 
of the function (in this case, mult) is used to denote the result of the operation. Thus, 
mult{O] represents the numerator of the result, and mult| 1] the denominator of the result. 
That is, it specifies what conditions become true after the operation is executed. In 
this example, the postcondition specifies that the numerator of the result of a rational 
multiplication equals the integer product of the numerators of the two inputs, and that 
the denominator equals the integer products of the two denominators. 

The specification for addition (add) is straightforward and simply states that 

a0 b0 _ a0*bl +al*b0 
al bl al *bl 

The creation operation (makerational) creates a rational number from two inte- 
gers and contains the first example of a-precondition. In general, preconditions specify 
any restrictions that must be satisfied before the operation can be applied. In this ex- 
ample, the precondition states that makerational cannot be applied if its second param- 
eter is 0. 

The specification for equality (equal) is more significant and more complex in 
concept. In general, any two values in an ADT are “equal” if and only if the values 
of their components are equal. Indeed, it is usually assumed that an equality (and an 
inequality) operation exists and is defined that way, so that no explicit equal operator 
definition is required. The assignment operation (setting the value of one object to the 
value of another) is another example of an operation that is often assumed for an ADT 
and is not specified explicitly. ` 

However, for some data types, two values with unequal components may be con- 
sidered equal. Indeed, such is the case with rational numbers; for example, the rational 
numbers 1/2, 2/4, 3/6, and 18/36 are all equal despite the inequality of their components. 
Two rational numbers are considered equal if their components are equal when the num- 
bers are reduced to lowest terms (that is, when their numerators and denominators are 
both divided by their greatest common divisor). One way of testing for rational equality 
is to reduce the two numbers to lowest terms and then test for equality of numerators 
and denominators. Another way of testing for rational equality is to check whether the 
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cross products (that is, the numerator of oné times the denominator of the other) are 
equal. This is the raethod that we used in specifying the abstract equal operation. 

The abstract specification illustrates the role of an ADT as a purely. logical defi- 
nition of a new data type. As collections of two integers, two ordered pairs are unequal 
if their components are not equal: yet as rational numbers, they may be equal. It is un- 
likely that any implementation of rational numbers would implement a test for equality 
by actually forming the cross products; they might be too large to represent as machine 
integers. Most likely, an implementation would first reduce the inputs to lowest terms 
and then test for component equality. Indeed, a reasonable implementation would in- 
sist that makerational, add, and mult only produce rational numbers in lowest terms. 
However, mathematical! definitions such as abstract data type specifications need not 
be concerned with implementation details. 

In fact, the realization that two rationals can be equal even if they are component- 


wise unequal forces us to rewrite the postconditions for makerational, add, and mult. 
That is, if 


mO — -a0 x b0 
m al bi 
it is not necessary that m0 equal a0 * bO and that m1 equal al * b1, only that mO * al 


* bl equal.ml * «u0 * b0. A more accurate ADT specification for RATIONAL is the 
following: 


/*value definition*/ 
abstract typedef<int, int> RATIONAL; 
condition RATIONAL[1} != 0; 


/*operator definition*/ 

abstract equal(a,b) /* written a == b*/ 
RATIONAL a,b; 

postcondition equal == (a{0]*b[1] == b[0]*a[1]}); 


abstract RATIONAL makerational (a,b) /* written [a,b}*/ 
int a,b; 

precondition b != 0; 

postcondition makerational[0}*b == a*makerational[{1] 


abstract RATIONAL add(a,b) /* written a + b */ 
RATIONAL a,b; 


postcondition add == [a[0] * b[1] + b{0} * af), af1)*b[1]] 


abstract RATIONAL mult(a,b) /* written a * b */ 
RATIONAL a,b; 


postcondition mult == [a[0] * b[0), a[l] * bf} ] 


Here, the equal operator is defined first. and the Operator = = is extended td 
rational equality using the written clause. That operator is then used to specify the 
results of subsequent rational operations (add and mult). 


Introduction to Data Structures Chap. 1 
16 


The result of the makerational operation on the integers a and b produces a ratio- 
nal that equals a/b, but the definition does not specify the actual.values of the resulting 
numerator and denominator. The specification for makerational also introduces the no- 
tation [a,b] for the rational formed from integers a and b, and this notation is then used 
in defining add and mult, : 

The definitions of add and mult specify that their results equal the unreduced re- 
sults of the corresponding operation, but the individual components are not necessarily 
equal. . 

Notice, again, that in defining these operators we are not specifying how they are 
to be computed, only what their result must be. How they are computed is determined 
by their implementation, not by their specification. 


' Sequences as Value Definitions 


In developing the specifications for various data types, we often use set-theoretic 
notation to specify the values of an ADT. In particular, it is helpful to use the notation 
of mathematical sequences that we now introduce, 


A sequence is simply an ordered set of elements. A sequence S is sometimes 
written as the enumeration of its elements, such as 


S= 150, Sh ikasi 


If S contains n elements, S is said to be of length n. We assume the existence of 
a length function len such that len(S) is the length of the sequence S. We also assume 
functions first(S), which returns the value of the first element of S (So in the foregoing 
example), and /ast(S), which returns the value of the last element of S (s,-; in the 
foregoing example). There is a special sequence of length 0, called nilseq, that contains 
no elements. first(nilseq) and last(nilseq) are undefined. 

We wish to define an ADT stp] whose values ure sequences of elements. If the 


sequences can be of arbitrary length and consist of elements all of which are of the same 
type, tp, then stp1 can be defined by ‘ 


abstract typedef <<tp>> stpl; 


Alternatively, we may wish to define ah ADT stp2, whose values are sequences of 


fixed length whose elements are of specific types. In such a case, we would specify the 
definition 


abstract typedef <tp0, tpl, tp2, ..., tpn> stp2; 


Of course, we may want to specify a sequence of fixed length all of whose elements are 
of the same type. We could then write 


abstract typedef <<tp,n>> stp3; 


In this case stp3 represents a sequence of length n, all of whose elements are of type tp. 
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For example, using the foregoing notation we could define the foHowing types: 


abstract typedef <<int>> intseq; 
/* sequence of integers of */ 
j * - any length 4f 
abstract typedef <integer, char, float> seq3; 
/* sequence of length 3 = */ 
/* consisting of an integer, */ 
/* a character and a */ 
/* floating-point number */ 


abstract typedef <<int,10>> intseq; 
i /* sequence of 10 integers */ 


abstract typedef <<,2>> pajr; 
/* arbitrary sequence of */ 
/* length 2 baie, | 


Two sequences are equal if each element of the first is, equal to the corresponding el- 
ement of the second. A subsequence is a contiguous portion of a sequence. If Sisa 
sequence, the function sub(S,i, j) refers to the subsequence of-S starting at position i in 
S and consisting of j consecutive elements. Thus if T equals sub(S,i,k), and T is the se- 
quence < to, fi, .-.5%ke-1 >, fo = Sint) = Sits cees fk-1 = Si+k-1- If i is not between 
0 and Jen(S) — k, then sub(S,i,k) is defined as nilseq. l 

The concatenation of two sequences, written S + T, is the sequence consisting 
of all the elements of S followed by all the elements of T. It is sometimes desirable to 
specify insertion of an element in the middle of a sequence. place(S,i, x) is defined as 
the sequence S$ with the element x inserted immediately following position i (or into 
the first element of the sequence if i is — 1). All subsequent elements are shifted by one 
position. That is, place(S,i,x) equals sub(S,0,i + 1) + <x > + sub(S,i +1, len(S) — 
i—i): 

Deletion of an element from a sequence can be specified in one of two ways. If x is 
an element of sequence S, S — < x > represents the sequence S without all occurrences 
of element x. The sequence delete(S,i) is equal to S with the element at position i deleted. 
delete(S,i) can also be written in terms of other operations as sub(S,0,i) + sub(S,i + 
1,len(S) — i- 1). 


ADT for Varying-length Character Strings 


* Asan illustration of the use of sequence notation in defining an ADT, we develop 
an ADT specification for the varying-length character string. There are four basic oper- 


ations (aside from equality and assignment) normally included in systems that support 
such strings: 


length a function that returns the current length of the string 
concat a function that returns the concatenation of its two input strings 
substr a function that returns a substring of a given string 
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pos a function that returns the first position of ene string asa 
substring of another 


abstract typedef <<char>> STRING; 


abstract length(s) 
STRING s; 
postcondition length == len(s); 


abstract STRING concat(s1,s2) 
STRING 51,52; 
postcondition concat == sl + s2; 


abstract STRING substr(sl,i,j)’ 
STRING sl; 
int i,j; 
precondition 0 <= i < len(s1); 

0 <= j <= len(sl) - i; 
postcondition substr == sub($1,i,j); 


abstract pos(s1, s2) 
STRING s1,s2; y : 
postcondition /*lastpos = len(sl) - len(s2) */ 
((pos == -1) && (for(i = 0; 
; i <= lastpos; i++) 
(s2 <> sub(s1,i, len(s2))))) 
lI 
((pos >= 0) & (pos <= lastpos) 
&& (s2 == sub(strl,pos, len(s2)) 
&& (for(i = 1; i < pos; i+) 
(s2 < sub(sl,i,]en(s2))))); 


The postcondition for pos is complex and introduces some new notation, so we 
review it here. First, note the initial comment whose content has the form of a C as- 
signment statement. This merely indicates that we wish to define the symbol lastpos as 
representing the value of len(s1) — len(s2) for use within the postcondition to simplify 
the appearance of the condition. Here, /astpos represents the maximum possible value 
of the result (that is, the last position of s1 where a substring whose length equals that 
of s2 can start). lastpos is used twice within the postcondition itself. The longer expres- 
sion len(s1) — len(s2) could have been used in both cases, but.we chose to use a more 
compact symbol (/astpos) for clarity. 

The postcondition itself states that one of two conditions must hold. The two con- 
ditions, which are separated by the || operator, are as follows: 


1. The function's value (pos) is —1, and s2 does not appear as a substring of s1. 
2. The function’s value is between 0 and /astpos@s2 does appear as a substring of s1 


beginning at the function value’s position, and s2 does not appear as a substring 
of sl in any earlier position. 
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Note the use of a pseudo-for gop in a condition. The condition 


for (i = x; i <= y; i++) 
(condition(i)) 


is true if condition(i) is true for all i from x to y inclusive. It is also true if x > y. 
Otherwise, the entire for-condition is false. 


Data Types in C 


The C language contains four basic data types: int, float, char and double. In most 
computers, these four types are native to the machine’s hardware. We have already 
seen how integers, floats, and characters can be implemented in hardware. A double 
variable is a double-precision floating-point number. There are three qualifiers that can 
be applied to ints: short, long, and unsigned. A short or long integer variable refers to 
the maximum size of the variable’s value. The actual maximum sizes implied by short 
int, long,int, or int vary from machine to machine. An unsigned integer is an integer 
that is always positive and follows the arithmetic laws of modulo 2", where n is the 
number of bits in an integer. . ar a 

A variable declaration in C specifies two things. First, it specifies the amount of 
storage that must be set aside for objects declared with that type. For example, a ` ‘ariable 
of type int must have enough space to hold the largest possible integer value. Second, it 
specifies how data represented by strings of bits are to be interpreted. The same bits at 
a specific storage location can be interpreted as an integer of a floating-point number, 
yielding two completely different numeric values. 

A variable declaration specifies that storage be set aside for an object of the speci- 


fied type and that the object at that storage location can be referenced with the specified 
variable identifier. 


Pointers in C 


In fact, C allows the programmer to reference the location of objects as well as 
the objects (that is, the contents of those locations) themselves. For example, if x is 
declared as an integer, &x refers to the location that has been set aside to contain x. &x 
is called a pointer. 

` Itis possible to declare a variable whose data type is a pointer and whose possible 
values are memory locations. For example, the declarations 

int *pi; 

float *pf; 

char *pc; 


declare three pointer variables: pi is a pointer to an integer, pf is a pointer to a float 
number, and pc is a pointer to a character. The asterisk indicates that the values of the 
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variables being declared are pointers to values of the type specified in the declaration 
rather than objects of that type. 

A pointer is like any other data type in C in many respects. The value of a pointer 
is a memory location in the way that the value of an integer is a number. Pointer values 
can be assigned like any other values. For example, the statement pi = &x; assi: 1S a 
pointer to the integer x to the pointer variable pi. 

The notation *pi in C refers to the integer at the location referenced by the pointer 
pi. The statement x = *pi; assigns the value of that integer to the integer variable x. 

Note that C insists that a declaration of a pointer specify the data-type to which 
the pointer points. In the foregoing declarations, each of the variables pi, pf, and pc are 
pointers to a specific data type: int, float, and char, respectively. The type of pi is not 
simply “pointer” but “pointer to an integer.” In fact, the types of pi and pf are different: 
pi is a pointer to an integer, and pf is a pointer to a float number. Each data type df 
in C generates another data type, pdt, called “pointer to dt.” We call dt the base type 
of pdt. 


The conversion of pf from the type “pointer to a float number” to the type “pointer 
to an integer” can be made by writing 


pi = (int *) pf; 


where the cast (int *) converts the value of pf to the type “pointer to an int,” or “int *.” 

The importance of each pointer being associated with a particular base type be- 
comes clear in reviewing the arithmetic facilities that C provides for pointers. If pi is 
a pointer to an integer, then pi + | is the pointer to the integer immediately following 
the integer *pi in memory, pi — 1 is the pointer to the integer immediately preceding 
*pi, pi + 2 is the pointer to the second integer following *pi, and so.on. For example, 
suppose that a particular machine uses byte addressing, an integer requires four bytes, 
and the value of pi happens to be 100 (that is, pi points to the integer *pi at location 
100). Then the value of pi — 1 is 96, the value of pi + 1 is 104 and the value of pi + 
2 is 108. The value of *(pi — 1) is the contents of the four bytes 96, 97, 98, and 99 
interpreted as an integer; the value of *(pi + 1) is the contents of bytes 104, 105, 106, 
and 107 interpreted as an integer; and the value of *(pi + 2) is the integer at bytes 108, 
109,110, and 111. ` ; 

Similarly, if the value of the variable pc is 100 (recall that pc is a pointer to a 
character) and a character is one byte long, pc — 1 refers to location 99, pe + 1 to 
location 101, and pe + 2 to location 102. Thus the result of pointer arithmetic in C 
depends on the base type of the pointer. i 

Note also the difference between *pi + 1, which refers to 1 added to the integer 
pi, and *(pi + 1), which refers to the integer following the integer at location pi. 

One area in which C pointers play a prominent role is in passing parameters to 
functions. Ordinarily, parameters are passed to a C function by value, that is, the values 
being passed are copied into the parameters of the called function at the time the func- 
tion is invoked, If the value of a parameter is changed within the function, the value 
in the calling program is not changed. For example, consider the following program 
segment and function (the line numbers are for reference only): 
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1. yum 55 

2 printf("%d\n", x); 
3) funct(x); 

4 printf("%d\n", x); 


5 void funct(int y) 

6 { 

7 ++y; 

8 printf("%d\n", y); 
9 } /* end funct */ 


Line 2 prints 5 and then line 3 invokes funct. The value of x, which is 5, is copied into 
y and funct begins execution. Line 8 then prints 6 and funct returns. However, when 
line 7 increments the value of y, the value of x remains unchanged. Thus line 4 prints 
5. x and y refer to two different variables that happen to have the same value at the 
beginning of funct. y can change independently of x. 


If we wish to use func? to modify the value of x, we must pass the address of x as 
follows: 


Xe! 5i 


il 
2 printf("%d\n", x); 
3 funct(&x); 
4 printf("%d\n", x); 
5 void funct(int *py) 
6 { 
sme g ++(*py); 
8 printf("%d\n", *py); 
9: } /* end funct */ 


Line 2 again prints 5 and line 3 invokes funct. Now, however, the value passed is not 
the integer value of x, but the pointer value &x. This is the address of x. The parameter 
of funct is no longer y of type int but py of type int *. (It is convenient to name pointer 
variables beginning with the letter p as a reminder to both the programmer and"the 
program reader that it is a pointer. However, this is not a requirement of the C language 
and we could have named the pointer parameter y.) Line 7 now increments the integer 
at location py. py, itself, however, is not changed and retains its initial value &x. Thus 
py points to the integer x, so that when *py is incremented, x is incremented. Line 8 
prints 6 and when funct returns, line 4 also prints 6. Pointers are the mechanism used 
in C4 allow a called function to modify variables in a calling function. 


Data Structures and C 


A C programmer can think of the C language as defining a new machine with 
its own capabilities, data types, and operations: The user can state a problem solution 
in terms of the more useful C constructs rather than in terms of lower-level machine- 

i 
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language constructs. Thus, problems can be solved more easily because a larger set of 
tools is available. 

The study of data structures therefore involves two complementary goals. The 
first goal is to identify and develop useful mathematical entities and operations and to 
determine what classes of problems can be solved by using these entities and opera- 
tions. The second goal is to determine representations for those abstract entities and to 
implement the abstract operations on these concrete répresentations. The first of these 
goals views a high-level data type as a tool that can be used to solve other problems, 
and the second views the implementation of such a data type as a problem to be solved 
using aiready existing data types. In determining representations for abstract entities, 
we must be careful to specify what facilities are available for constructing such repre- 
sentations. For example, it must be stated whether the full C language is available or 
whether we are restricted to the hardware facilities of a particular machine. 

In Sections 1.2 and 1.3 we examine several data structures that already exist in C: 
the array and the structure, We describe the facilities that are available in C for utilizing 
these structures. We also focus on the abstract definitions of*these data structures and 
how they can be useful in problem solving. Finally, we examine how they could be 
implemented if C were not available (although a C programmer can simply use the _ 
data structures as defined in the language without being concerned with most of these 
implementation details). 

In the remainder of the book, we develop more complex data structures and show 
their usefulness in problem solving. We also show how to implement these data struc- 
tures using the data structures that are already available in C. Since the problems that 
arise in the course of attempting to implement high-level data structures are quite com- 
plex, this will also allow us to investigate the C language more thoroughly and to gain 
valuable experience in the use of this language. 

i Often no implementation, hardware or software, can model a mathematical con- 
cept completely. For example, it is impossible to represent arbitrarily large integers on 
a computer, since the size of such a machine’s memory is finite. Thus, it is not the data 
type “integer” that is represented by the hardware but rather the data type “integer be- 
tween x and y,” where x and y are the smallest and largest integers representable by that 
machine. 

It is important to recognize the limitations of a particular implementation. Often 
it will be possible to present several implementations of the same data type, each with 
its own strengths and weaknesses. One particular implementation may be better than 
another for a specific application, and the programmer must be aware of the possible 
trade-offs that might be involved. 

One important consideration in any implementation is its efficiency. In fact, the 
reason that the high-level data structures that we discuss are not built into C is the 
significant overhead that they would entail. There are languages of significantly higher 
level than C that have many of these data types already built into them, but many of 
them are inefficient and are therefore not in widespread use. 

Efficiency is usually measured by two factors: time and space. If a particu- 
lar application is heavily dependent on manipulating high-level data structures. the 
speed at which those manipulations can be performed will be the major determinant of 
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the speed of the entire application. Similarly, if a program uses a large number of such 
structures, an implementation that uses an inordinate amount of space to represent the 
data structure will be impractical. Unfortunately, there is usually a trade-off between 
these two efficiencies, so that an implementation that is fast uses more storage than one 
that is slow. The choice of implementation in such a case involves a careful evaluation 
of the trade-offs among the various possibilities. 


EXERCISES 


1.1.1. In the text, an analogy is made between the length of a line and the number of bits of 
information in a bit string. In what ways is this analogy inadequate? 

1.1.2. Determine what hardware data types are available on the computer at your particular 
installation and what operations can be performed on them. 


1.1.3. Prove that there are 2” different settings for n two-way switches. Suppose toat we wanted 
to have m settings. How many switches would be necessary? 


1.1.4. Interpret the following bit settings as binary positive integers, as binary integers in twos 
complement, and as binary coded decimal integers. If a setting cannot be interpreted as 
a binary coded decimal integer, explain why. 

(a) 10011001 (da) 01110111 
(b) . 1001 (e) 01010101 
(c) 000100010001 (£) .100000010101 

1.1.5. Write C functions add, subtract, and multiply that read two strings of Os and 1s represent- 
ing binary nonnegative integers, and print the string representing their sum, difference, 
and product, respectively. 

1.1.6. Assume a ternary computer in which the basic unit of memory is a “trit” (ternary digit) 
rather than a bit. Such a trit can have three possible settings (0, 1, and 2) rather than just 
two (0 and 1). Show how nonnegative integers can be represented in ternary notation 
using such trits by a method analogous to binary notation using bits. Is there any non- 
negative integer that can be represented using ternary notation and trits that cannot be 
represented using binary notation and bits? Are there any that can be represented using 
bits that cannot be represented using trits? Why are binary computers more common 
than ternary computers? 

1.1.7. Write a C program to read a string of Os and 1s representing a positive integer in binary 
and print a string of Os, 1s, and 2s representing the same number in ternary notation (see 
the preceding exercise). Write another C program to read a ternary number and print the 
equivalent in binary. 

1.1.8, Write an ADT specification for complex numbers a + bi, where abs(a + bi) is sqrt(a? + 
b°), (a + bi) + (c + di) is (a + c) + (b + d)i, SaaS a bad) t 
(a*d+b*c)i, and —(a + bi) is (—a) + (b)i. 


1.2 ARRAYS IN C 


In this section and the next we examine several data structures that are an invaluable 
part of the C language. We will see how to use these structures and how they can be 


24 Introduction to Data Structures Chap. 1 


lemented. These structures are composite or structured data types; that is, they are 
made up of simpler data structures that exist in the language. The study of these data . 
structurés involves an analysis of how simple structures combine to form the composite ` 
and how to extract a specific component from the composite. We expect that you have 
already seen these data structures in an introductory C programming course and that you 
are aware of how they age defined and used in C. In these sections, therefore, we will 
not dwell on the marty details associated with these structures but instead will highlight 
those features that are interesting from a data-structure point of view. 
` The first of these data types is the array. The simplest form of array is a one- 
dimensional array that may be defined abstractly as a finite ordered set of homogeneous 
elements. By “finite” we mean that there is a specific number of elements in the array. 
This number may be large or small, but it must exist. By “ordered” we mean that the 
elements of the array are arranged so that there is a zeroth, first, second, third, and so 
forth. By “homogeneous” we mean that all the elements in the array must be of the 
same type. For example, an array may contain all integers or all characters but may not 
contain both. 
. However, specifying the form of a data structure does not yet completely describe 


the structure. We must also specify how the structure is accessed. For example, the C 
declaration ; 


int a{100]; 


specifies an array of 100 integers. The two basic operations that, access an array are 
extraction and storing. The extraction operation is a function that accepts an array, a, 
and an index, i, and returns an element of the array. In C, the result of this operation is 
denoted by the expression a[i]. The storing operation accepts an array, a, an index, i, 
and an element, x. In C this operation is denoted by the assignment statement a(i] = x. 

` The operations are defined by the rule that after the foregoing assignment statement has 
been executed, the value of ali] is x. Before a value has been assigned to an element of 
the array, its value is undefined and a reference to it in an expression is illegal. 

The smallest element of an array’s index is called its lower bound and in C is 
always 0, and the highest element is called its upper bound. If lower is the lower bound 
of an array and upper the upper bound, the number of elements in the array, called ‘its 
range, is given by upper — lower + 1. For example, in the array, a, declared previously, 
the lower bound is 0, the upper bound is 99, and the range is 100. 

An important feature of a C array is that neither the upper bound nor the lower 
bound (and hence the range as well) may be changed during a program's execution. The 
lower bound is always fixed at 0, and the upper bound is fixed at the time the program 
is written. j 

One very useful technique is to declare a bound as a constant identifier, so that 
the work required to modify the size of an array is minimized. For example, consider 
the following program segment to declare and initialize an array: 


ë int a[100]; é 
for(i = 0; i < 100; a[i+] = 0); 
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To change the array to a larger (or smaller) size, the constant 100 must be changed int 


Places: once in the declarations and once in the for statement. Consider the following 
equivalent alternative: ; 


#define NUMELTS 100 
int a[NUMELTS]; 
for(i = 0; i < NUMELTS; a[i++] = 0); 


Now only a single change in the constant definition is needed to change the upper bound. 


The Array as an ADT 


We can represent an array as an abstract data type with a slight extension of the 
conventions and notation discussed earlier. We assume the function type(arg), which 
returns the type of its arguinent, arg. Of course, such a function cannot exist in C, 
since C cannot dynamically determine the type of a variable. However, since we are 
not concerned here with implementation, but rather with specification, the use of such 
a function is permissible. 

Let ARRTYPE(ub,eltype) denote the ADT corresponding to the C array type el- 
type array(ub]. This is our first example of a parameterized ADT, in which the pre- 
cise ADT is determined by the values of one or more parameters. In this casé, ub and 
eltype are the parameters; note that eltvpe is a type indicator, not a value. We may 
now view any one-dimensional array as an entity of the type ARRTYPE. For example, 
ARRTYPE(10,int) would represent the type of the array x in the declaration int x{10}. - 


We may now view any one-dimensional array as an entity of the type ARRTYPE. The 
specification follows: 


abstract typedef <<eltype, ub>> ARRTYPE(ub, eltype); 
condition type(ub) == int; 


abstract eltype extract(a, i) /* written a[i] */ 
ARRTYPE(ub, eltype) a; 

int i; 

precondition 0 <= i < ub; 

postcondition extract == a; 


abstract store(a, i, elt) /* written a[i] = elt */ 

ARRTYPE (ub, eltype) a; 

int 7; 

eltype elt; s 
-sprecondition 0 <= i < ub; 

postcondition a[i] == elt; 


The store operation is our first example of an operation that modifies one of its 
parameters; in this case the array a. This is indicated in the postcondition by specifying 
the value of the array element to which elt is being assigned. Uniess a modified value 
is specified in a postcondition, we assume that all parameters retain the same value af- 
ter the operation is applied in a postcondition as before. It is not necessary to specify that 
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such. values remain unchanged, Thus, in this example, all array elements other than the 
one to which elt is assigned retain the same values. 

Note that once the operation extract has been defined, together with its bracket 
notation, a[i], that notation can be used in the postcondition for the subsequent store 
operation specification. Within the postcondition of extract, however, subscripted se- 
quence notation must be used, since the array bracket notation itself is being defined. 


Using One-Dimensional Arrays 


A one-dimensional array is used when it is necessary to keep a large number of 
items in memory and reference all the items in a uniform manner. Let us see how these 
two requirements apply to practical situations. . 

Suppose that we wish to read 100 integers, find their average, and determine by 
how much each integer deviates from that average. The following program accom- 
plishes this: 


#define NUMELTS 100 
void main() 


{ « 
int num[NUMELTS] ; /* array of numbers */ 
int i; 
int total; /* sum of the numbers */ 
float avg; /* average of the numbers */ 
float diff; /* difference between each £j 
/* number and the average | */ 

total = 0; , 
for (i = 0; i < NUMELTS; i++) { 

/* read the numbers into the array and add them if i 


scanf("%d", num[i]); 
total += num[i]; 
} /* end for */ 
avg = (float) total / NUMELTS; /* compute the average */ 
printf("\nnumber difference’); /* print heading +7 
/* print each number. and its difference */ 


for (i = 0; i < NUMELTS; i++) { 
diff = num[i] - avg; 
printf("\n %d %f", num[i], diff); 
} /* end for */ 
printf("\naverage is: %f", avg); 
} /* end main */ 


This program uses two groups of 100 numbers. The first group is the set of input 
integers and is represented by the array num, and the second group is the set of differ- 
ences that are the successive values assigned to the variable diff in the second loop. The 
question arises, why is an array used to hold all the values of the first group simultane- 
ously, whereas only a single variable is used to hold one value of the second group at a~ 
time? , 
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The answer is quite simple. Each difference is computed and printed and is never 
needed again. Thus the variable diff can be reused for the difference of the next integer 
and the average. However, the original integers that are the values of the array num 
must all be kept in memory. Although each can be added inte foral as it is input, it must 
be retained until after the average is computed in order for the program to compute the 
difference between it and the average. Therefore, an array is used. f 

Of course, 100 separate variables could have been used to hold the integers. The 
advantage of an array, however, is that it allows the programmer to declare only a single 
identifier and yet obtain a large amount of space. Furthermore, in conjunction with the 
Sor loop, it also allows the programmer to reference each element of the group in a 
uniform manner instead of forcing him or her to code a statement such as 


scanf ("%d%d%d. ..%d", &num0, &num1, &num2, ..., &num99); 


A particular element of an array may be retrieved through its index. For example, 
Suppose that a company is using a program in which an array is declored by 


int sales[10]; 


The array will hold sales figures for a ten-year period. Suppose that each line input to 
the program contains an integer from 0 to 9, representing a year as well as a sales figure 
for that year, and that it is desired to read the sales figure into the appropriate element 
of the array. This can be accomplished by executing the Statement 


scanf("%d%d", &yr, &sales[yr]); 


within a loop. In this statement, a particular element of the array is accessed directly by 
using its index. Consider the situation if ten variables s0, s1, ... , s9 had been declared. 
Then even after executing scanf(“%d" &yr) to set yr to the integer representing the year, 
the sales figure could not be read into the proper variable without coding something like 


switch(yr) { 


case 0: scanf("%d", &s0); 
case 1: scanf("%d", &s1); 


case 9: scanf("%d", &s9); 7 
} /* end switch */ 


This is bad enough with ten elements—imagine the inconvenience if there were a hun- 
dred or a thousand. 


Implementing One-Dimensional Arrays 
"i A one-dimensional array can be implemented easily. The C declaration 


int b[100]; 
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reserves 100 successive memory locations, each large enough to contain a single integer. 
The address of the first of these locations is called the base address of thé array b and 
is denoted by base(b). Suppose that the size of each individual element of the array 
is esize. Then a reference to the element b[0] is to the element at location base(b), 
a reference to b[1] is to the element at base(b) + esize, a reference to b[2] is to the 
element base(b) + 2 * esize. In general, a reference to b[i] is to the element at location 
base(b) + i * esize. Thus it is possible to reference any element in the array, given its 
index. 

In fact, in the C language an array biei is implemented asa pointer variable. 
The type of the variable bin the above declaration is “pointer to an integer” or int *. An 
asterisk does not appear in the declaration’because the brackets “automatically imply 
that the variable is a pointer. The difference between the declarations int *b; and int 
b[ 100]; is that:the latter also reserves 100 integer locations starting at location b. In 
C the value of the variable b is base(b), and the value of the variable b[i], where i is 
an integer, is *(b + i). Recall from Section 1.1 that, since b is a pointer to an integer, 
*(b + i) is the value of the ith integer following the integer at location b. b[i], the element 
at location base(b) + i * esize, is equivalent to the element pointed to by b + i, which 

is *(b + i). . 

In C all elements of an array have the same fixed, predetermined size. Some pro- 
gramming languages, however, allow arrays of objects of differing sizes. For example, 
a language might allow arrays of varying-lengih character strings. In such cases, the 
above method cannot be used to implement the array. This is because this method of 
calculating the address of a specific element of the array depends upon knowing the 
fixed size (esize) of each preceding element. If not all the elements ayer the same size, 
a different implementation must be used. : 

One method of implementing an array of varying-sized elements is to reserve 
a contiguous set of memory locations, each of which holds an address. The contents 
of each such memory location are the address of the varying-length array element in 
some other portion of memory. For example, Figure 1.2.1a illustrates an array of five 
varying-length character strings under the two implementations of varying-length in- 
tegers presented in Section 1.1. The arrows in the diagram indicate addresses of other 
portions of memory. The character ‘W’ indicates a blank. (However, in C a:string is it- 
self implemented as an array, so that an array of strings is actually an array of mpa 
two-dimensional rather than a one-dimensional array.) 

Since the length of each address is fixed, the location of the address of a ticiet 
element can be computed in the same way that the location of a fixed-length element 
was computed in the previous examples. Once this location is known, its contents can 
be used to dejermine the location of the actual array element. This, of course, adds an 
extra level of indirection to referencing an array element by involving an extra memory 
reference, which i in turn decreases efficiency. However, this is a small price to pay for 
the convenience of being able to maintain such an array. 

A similar method for implementing an array of varying-sized elements is to keep 
all fixed-length portions of the elements in the contiguous array area. in addition to 
keeping the address of the varying-length portion in the contiguous area. For example, 
in the implementation of varying-length character strings presented in the previous _ 
section. each such string contains a fixed-length portion (a one-byte length field) and 
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(a) 


Figure 1.2.1 Implementations of an array of varying-length strings. Contin- 
ues on page 31. 


BREE 
efelelle liiteli 


(b) 


Figure 1.2.1 Concluded. 


a variable-length portion {the character string itself). One implementation of an array 
of varying-length character strings keeps the length of the string together with the ad- 
dress, as shown in Figure 1.2.1b. The advantage of this method is that those parts of an 
element that are of fixed length can be examined without an extra memory reference. 
For example, a function to determine the current length of a varying-length character 
string can be implemented with a single memory lookup. The fixed-length information 
for an array element of varying length that is stored in the contiguous memory area of - 
the array is often called a header. 


Arrays as Parameters 


Every parameter of a C function must be declared within the function. However, 
the range of a one-dimensional array parameter is only specified in the main program. 
This is because in C new storage is not allocated for an array parameter. Rather, the 
parameter refers to the original array that was allocated in the calling program. For 


example, consider the following function to compute the average of the elements of an 
array: £ 


float avg(float a[], int size) /* no rangè is specified for the array a */ 
{ ; 

int i; 

float sum; 
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sum = 0; 
for (i=0; i < size; i++) 
sum += ali]; : ` 
return(sum / size); 
} /* end avg */ 


In the main program, we might have written 


#define ARANGE 100 
float a[ARANGE]; 


avg(a, ARANGE); 


Note that if the array range is needed in the function, it must be passed separately. 

Since an array variable in C is a pointer, array parameters are passed by reference 
rather than by value. That is, unlike simple variables that are passed by value, an array’s 
contents are not copied when it is passed as a parameter in C. Instead, the base address 
of the array is passed. If a calling function Contains the call funct(a), where a is an array 
and the function funct has the header 


void funct(int. b[]) 
the statement 

bli] = x; - 
inside funct modifies the value of a[i] inside the calling function. b inside funct refers 
to the same array of locations as a in the calling function. . 

Passing an array by. reference rather than by value is more efficient in both time 
and space. The time that would be required to copy an entire array on invoking a’ func- 
tion is eliminated. Also the space that would be needed for a second copy of the array 
in the called function is reduced to space for only a single pointer variable. 


Character Strings in C 


A string is defined in C as an array of characters. Each string is terminated by the, 


"5 ., NULL character, which indicates the end of the string. A string constant is denoted by 


` any set of characters included in double-quote marks. The NULL character is automat- 
ically appended to the end of the characters in a string constant when they are:stored. 
Within a program, the NULL character is denoted by the escape sequence \ Q. Other 
escape sequences that can be used are \ n for a new line character, \ t for a tab character, 
\ b for a backspace character, \”” for the double-quote character, \ \ for the backslash 
character, \ ° for the single-quote character, \r for the carriage return character and \ f 
for the form feed character. 
- A string constant represents an array whose lower bound is 0 and whose up- 
per bound jis the number of characters in the String. For example, the string “HELLO 
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THERE” is an array of twelve characters (the blank and \ 0 each counts as a character), 
and “I DON\ `T KNOW HIM” is an array of sixteen characters (the escape sequence 
\ ° represents the single-quote character). 


Character String Operations 


Let us present C functions to implement some primitive operations on character 
Strings. For all these functions, we assume the global declarations 


#define STRSIZE 80 
char str“ng[STRSIZE]; 


The first function finds the current length of a string. 


strlen(string) 
char string[]); 


înt i; 
for (i=0; string[i] != '\0'; i++) 


return(i): 
} /* end strlen */ 


The second function accepts two strings as parameters. The function returns an 
integer indicating the starting location of the first occurrence of the second parameter 
string within the first parameter string. If the second string does not exist within the 
first, — 1 is returned. 


int strpos(char s1[], char s2[]) 
{ à 


int Jenl, len2; 
int i, jl, j2; 


len1 = strlen(s1); 

len2 = strlen(s2); 

for (i=0; i+len2 <= lenl; i++) 

for (jl=i, j2=0; j2 <= len? & sl{jl] == s2[j2]; 
jl++, j2++) 
if (j2 == len?) 

return(i); 

return(-1); 

} /* end strpos */ 


Another common operation on strings is congatenation. The result of concatenat- 


ing two strings consists of the characters of the first followed by the characters of the 
second. The following function sets s1 to the concatenation of sl and s2. 
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void strcat(char s1[], char s2{]) 
{ 


int iJ 
for (i=0; sl[i] != '\0'; i++) 


for (j=0; s2[j] != '\0'; silis] = s2[j+]) 


} /* erd strcat */ 


; The last operation we present on strings is the substring operation. substr(s1,i,j.s2) 
sets the sting s2 to the j characters beginning at 51 [i]. 


void suostr(char sl[{], int i, int j, char s2[}) - 
{ 


int k, m; 
, for (k=i, m=0; m< j; S2{m++] = si[k++]) 


; 
s2im] = '\0'; 
} /* end substr */ : ` 


Two-Dimensional Arrays 


The component type of an array can be another array. For example, we may define 


int a[3] (5); 


This defines a new array containing three elements. Each of these elements is 
itself an array containing five integers. Figure 1.2.2 illustrates such an array. An clement ` 
of this array is accessed by specifying two indices: a row number and a column number. 
For example, the element that is darkened in Figure 1.2.2 is in row 1 and column 3 and 
may be referenced as af 1][3]. Such an array is called a two-dimensional array. The 
number of rows or columns is called the range of the dimension. In the array a, the 
range of the first dimension is 3 and the range of the second dimension is 5. Thus array 
a has three rows and five columns. 


Column Column Column Column Column 
0 1 2 3 4 


Row 0 
Row 1 


Row 2 


Figure 1.2.2 Two-dimensional arrays. 
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A two-dimensional array clearly illustrates the differences between a logical and 
a physical view of data. A two-dimensional array is a logical data structure that is useful 
in programming and problem solving. For example, such an array is uscful in describing 
an object that is physically two-dimensional, such as a map or a checkerboard. [tis also 
useful in organizing a set of values that are dependent upon two inputs. For example, 
a program for a department store that has 20 branches, each of which sells 30 items, 
might include a two-dimensional array declared by 


int sales[20] [30]; 


Each clement sales{i||j] represents the amount of item j sold in branch i. 

However, although it is convenient for the programmer to think of the clements 
of such an array as being organized in a two-dimensional table (and programming 
languages do indeed include facilities for treating them as a two-dimensional array), 
the hardware of most computers has no such facilities. An arra; must be stored in the 
memory of a computer, and that memory is usually linear. By this we mean that the 
memory of a computer is essentially a one-dimensional array. A single address (that 
may be viewed as a subscript of a one-dimensiona! array) is used to retrieve a par- 
ticular item from memory. To implement a two-dimensional array, it is necessary to 
develop a method cf ordering its elements in a linear fashion and of transforming a 
two-dimensional reference to the linear representation. 

One method of representing a two-dimensiont array 2 memory is the row-major 
representation. Under this representation, the first row of the array occupies the first set 
of memory locations reserved for the array, the second row occupies the next set. and 
so forth. There may also be several locations at the start of the physical array that serve 
as a header and that contain the upper and lower bounds of the two dimensions. (This 
header shouid not be confused with the headers discussed earlier. This header is for the 
entire array. whereas the headers mentioned earlier are headers for the individual array 
elements.) Figure 1.2.3 illustrates the row-major representation of the two-dimensional 
array a declared above and illustrated in Figure 1.2.2. Alternatively, the header need 
not be contiguous to the array elements but could instead contain the address of the 
first element of the array. Additionally, if the elements of the two-dimensional array are 
variable-length objects. the elements of the contiguous area could themselves contain 
the addresses of those objects in a form similar to those of Figure }.2.1 for linear arrays. 

Let us suppose that a two-dimensional integer array is stored in row-major se- 
quence, as in Figure 1.2.3, and let us suppose that. for an array ar. base(ar) is. the 
address of the first element of the array. That is. if ar is declared by 


int ar[r1] [r2]; 


where rl and 72 are the ranges of the first and second dimension. respectively, base(ar) 
is the address of a7{O][O}. We also assume that esize is the size of each element in the 
array. Let us calculate the address of an arbitrary element, ar[i1][i2]. Since the element 
is in row #1, its address can be calculated by computing the address of the first element 
of row ii and adding the quantity i2 = esize (this quantity represents how far into row j1 
the element at column /2 is). But to reach the first element of row i] (that is. the element 
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Row 0 


Row 1 


Row 2 


Figure 1.2.3 Representing a two- 
dimensionai array. 


. 


ar{il}[0]), it is necessary to pass through il complete rows, each of which contains r2 
elements (since there is one element from each column in each row), so that the address 
of the first element of row i1 is at base(ar) + il * r2 * esize. Therefore the address of 
ar{il)[i2] is at 


base(ar) + (i1 * m + i2) * esize 


As an example, consider the array a of Figure 1.2.2, whose representation is il- 
lustrated in Figure 1.2.3. In this array, rl = 3, r2 = 5, and base(a) is the address of 
a{O}[0]. Let us also suppose that each element of the array requires a single unit of 
storage. so that esize equals 1. (This is not necessarily true, since a was declared as an 
array of integers and an integer may need more than one unit of memory on a particu- 


Jar machine. For simplicity, however, we accept this assumption.) Then the location of 
a{2][4] can be computed by 


base[a] + (2*54+4)*1 
that is, 


base(a) + 14 


on 


Figure 1.2.2, where w1 is 3 and 2 is 5. 
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4 


Figure 1.2.4 Alternative implementation of a two-dimensional array. 


To reference ar{/][j]. the array ar is first accessed to obtain the pointer arli]. The 
array at that pointer location is then accessed to obtain afi |[/]. 

Indeed, this second implementation is the simpler and more straightforward of 
the two. However, the ul arrays ar{O] through ar{ul — 1] would usually be allocated 
contiguously, with ar[0] immediately followed by ar{1]. and so on. The first imple- 
mentation avoids allocating the extra pointer array, ap. and computing the value of an 


explicit pointer to the desired row array. It is therefore more efficient in both space and 
time. 


Multidimensional Afrays 


C also allows arrays with more than two dimensions. For example, a three- 
dimensional array may be declared by 


int b[3][2] [4]; 


and is illustrated in Figure 1.2.5a. An element of this array is specified by three sub- 
scripts, such as b[2][0][3]. The first subscript specifies a plane number. the second sub- 
Script a row number, and the third a column number. Such an array is useful when a 
value is determined by three inputs. For example, an array of temperatures might be 
indexed by latitude, longitude, and altitude. 

For obvious reasons, the geometric analogy breaks down when we go bevond 
three dimensions. However, C does allow an arbitrary number of dimensions. For ex- 
ample, a six-dimensional array may be declared by 


int c [7] [15] [3] [5] [8] [2]; 
Referencing an element of this array would require six subscripts. such as 
[2 ]{3][0][1)[6]{1]. The number of different subscripts that are allowed in a particular 


position (the range of a particular dimension) equals the upper bound of that dimension. 
The number of elements in an array is the product of the ranges of all its dimensions. 
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Figure 1.2.5 Three-dimensional array. 
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For example, the array b contains 3 * 2 * 4 = 24 elements, and the array ¢ contains 
7*15*3%*5%*8*2 = 25,200 elements. 

The row-major representation of arrays can be extended to arrays of more than 
two dimensions. Figure 1.2.5b illustrates the representation of the array b of Figure 


1.2.5a. The elements of the previously described six-dimensiona! array c are ordered 
as follows: 


C(O) [0} (0) [0) £o] [0] 
C(O} [0] [0] [0] (0) (2) 
C{0} £0] (9) [oJ [1] (0) 
C(O) [0] (0) [0] (1) (1) 
C(O} [0] LO] [0] [2] [0] 


C6) [14] (2) [4] [5] [0] 
C{6) [14] [2] [4] (5) [1] 
C{6) [14] [2] [4] £6} [0] 
C{6} (14) (2) [4] (6} [1] 
C{6) [14] [2] [4] [7] (0) 
C(6} [14] [2] £4) [7] (1) 


That is, the last subscript varies most rapidly, and a subscript is not increased until all 
possible combinations of the subscripts to its right have been exhausted. This is similar 
to an odometer (mileage indicator) of a car where the rightmost digit changes most 
rapidly. 

What mechanism is needed to access an element of an arbitrary multidimensional 
array? Suppose that ar is an n-dimensional! array declared by 


int ar(r1)[r2]... [rn]; 
and stored in row-major order. Each element of AR is assumed to occupy esize storage 
locations, and base(ar) is defined as the address of the first element of the array (that 
is, ar[O}[O] ...[0]). Then, to access the element 


ar[il)[i2)...[in]; 


it is first necessary to pass through i} complete “hyper-planes,” each consisting of r2 * 
r3 * ...* rn elements to reach the first element of ar, whose first subscript is i1. Then 
it is necessary to pass through an additional i2 groups of r3* r4 * ...* rn elements to 
reach the first element of ar, whose first two subscripts are il and i2. respectively. A 
similar process must be carried out through the other dimensions until the first element 
whose first n — 1 subscripts match those of the desired element is reached. Finally, it 
is necessary to pass through iv additional elements to reach the element desired. 

Thus the address of ar{i1}{i2] ... [in] may be written as base(ar) + esize * {il * 
r2* ...# n+ i2* 73% 2. en + ...+ (in — 1) rn + in)], which can be evaluated 
more efficiently by using the equivalent formula: 
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base(ar) + esize * 
Lin + rn * (iln - 1) + r(n-1)* (6... 4 3 * (24+ 2* iD) oa) 


This formula may be evaluated by the following algorithm, which computes the address 


of the array element and places it into addr (assuming arrays i and r of size n to hold 
the indices and the ranges, respectively): l 


offset = 0; 
for (j = 0; j< n; j++) 

offset = r[j] * offset + i[j]; 
addr = base(ar) + esize * offset: 


EXERCISES 


1.2.1. (a) The median of an array of numbers is the element m of the array such that half 
the remaining numbers in the array are greater than or equal to m and half are less 
than or equal to m, if the number of elements in the array is odd. If the number of 
elements is even. the median is the average of the two elements m, and m, such 

> that half the remaining elements are greater than or equal to m, and mz, and half 
the elements are less than or equal to m, and m2. Write a C function that accepts 
an array of numbers and returns the median of the numbers in the array. 

(b) The mode of an array of numbers is the number m in the array that is repeated most 
frequently. If more than one number is repeated with equal maximal frequencies, 
there is no mode. Write a C f'~-tion that accepts an array of numbers and returns 
the mode or an indication that the mode does not exist. 


1.2.2. Write a C program to do the following: Read a group of temperature readings. A reading 
consists of two numbers: an integer between —90 and 90, representing the latitude at 
which the reading was taken, and the observed temperature at that latitude. Print a table 
consisting of each latitude and the average temperature at that latitude. If there are no 
readings at a particular latitude, print “NO DATA” instead of an average. Then print the 
average temperature in the northern and southern hemispheres (the northern consists 
of latitudes 1 through 90 and the southern consists of latitudes — 1 through —90). (This 
average temperature should be computed as the average of the averages, not the average 
of the original readings.) Also determine which hemisphere is warmer. In making the 
determination, take the average temperatures in all latitudes of each hemisphere for 
which there are data for both that latitude and the corresponding latitude in the other 
hemisphere. (For example. if there is data for latitude 57 but not for latitude —57, then the 
average temperature for latitude 57 should be ignored in determining which hemisphe:e 
is warmer.) 

1.2.3. Write a program for a chain of 20 department stores, each of which sells 10 different 

items. Every month, each store manager submits a data card for each item consisting of 

a branch number (from | to 20), an item number (from 1 to 10), and a sales figure (less 

than $100,000) representing the amount of sales for that item in that branch. However, 

some managers may not submit cards for some items (for example, not all items are sold 
in all branche’). You are to write a C program to read these data cards and print a table 
with 12 columns. The first column should contain the branch numbers from 1 to 20 and 
the word “TOTAL” in the last line. The next 10 columns should contain the sales figures 


Introduction to Data Structures Chap. 1 


1.2.4. 


1.2.5. 


1.2.6. 


1.2.7. 


for each of the 10 items for each of the branzhes, with the total sales of each item in the 
last line. The last column should contain the total sales of each of the 20 branches for all 
items, with the grand total sales figure for the chain in the lower right-hand corner. Each 
column should have an appropriate heading. If no sales were reported for a particular 


branch and item, assume zero sales. Do not assume that your input is in any particular 
order. 


Show how a checkerboard can be represented by a C array. Show how to represent the 
state of a game of checkers at a particular instant. Write a C function that is input to 
an array representing such a checkerboard and prints all possible moves that black can 
make from that position. 

Write a function printar(a) that accepts an m-by-n array a of integers and prints the 
valves of the array on several pages as follows: Each page is to contain 50 rows and 
20 columns of the array. Along the top of each page, headings “COL 0," "COL 1,” and 
so forth, should be printed and along the left margin of each page, headings “ROW 0.” 
“ROW 1," and so forth, should be printed. The array should be printed by subarrays. For 
example, if a were a 100-by-100 array, the first page contains a[O}{0] through a[49}[ 19], 
the second page contains a[0}{20] through a[49][39], the third page contains a|0J[40] 
through a[49}[59], and so on until the fifth page contains ‘a{0}[80] through a[49}[99], the 
sixth page contains a[50][0] through a[99][ 19], and so on. The entire printout occupies 
ten pages. If the number of rows is not a multiple of 50, or the number of columns is not 
a multiple of 20, the last pages of the printout should contain fewer than 100 numbers. 

Assume that each element of an array a stored in row-major order occupies four units of 
storage. If a is declared by each of the following. and the address of the first element of 
a is 100, find the address of the indicated array element: 


a. int a[100]; address of a0] 

b. int a[200]; address of a[100] 

c. int a[10)[20]; address of a[0][0] 
d. int a[10][20]; ` address of  a[2]{1] 
e. int a[10] [20]; address of  a[5] [1] 
f. int a[10)[20]; address of a[1][10) 
g. int a[10] [20]; address of a[2][10] 
h. int a[10) [20]; address of a(S} [3] 
i. int a[10][20}; address of a[9]{[19]) 


Write a C function listoff that accepts two one-dimensional array parameters of the same 


size: range and sub. range represents the range of an integer array. For example, if the 
elements of range are 


3 5 10 6 3 


range represents an array a declared by 
int a{3) [5] [10] (6) [3]; 


The elements of sub represent subscripts to the foregoing array. If sub[i] does not lie 
between 0 and range[i] — 1. all subscripts from the ith onwards are missing. In the 
foregoing example, if the elements of sub are 


p gii 


n 
‘ae 
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sub represents the one-dimensional array af 1 }{3][1][2]. The function /istoff should print 

the offsets from the base of the array a represented by range of all the elements of a that 

are included in the array (or the offset of the single element if all subscripts are within 
bounds) represented by sub. Assume that the size (esize) of each element of a is 1. In 

the foregoing example, listoff would print the values 4, 5, and 6. 

1.2.8. (a) A lower triangular array a is an n-by-n array in which a[i}{j] = = 0. if i<j. What 
is the maximum number of nonzero elements in such an array? How can these 
elements be stored sequentially in memory? Develop an algorithm for accessing 
aļiJ[j], where i > = j. Define an upper triangular array in an analogous manner 
and do the same for such an array as for the lower triangular array. 

(b) A strictly lower i iangular array a is an n-by-n array in which a{iJ{j] = = 0 if 
i <= j. Answer the questions of part a for such an array. 

(c) Letaand b be two n-by-n lower triangular arrays. Show how an n-by-(n + 1) array 
c can be used to contain the nonzero elements of the two arrays. Which elements 
of c represent the elements aļi J[j] and b[i)LjJ. respectively? 

(d) A tridiagonal array a is an n-by-n array in which afi }[j ] = = 0, if the absolute 
value of i — j is greater than 1. What is the maximum number of nonzero elements 
in such an array? How can these elements be stored sequentially in memory? De- 

, velop an algorithm for accessing afi ]Ų } if the absolute value of i — j is 1 or less. 
Do the same for an array a in which a[i JŲ ] = = 0, if the absolute value of i — j 
is greater than k. 


1.3 STRUCTURES IN C 


In this section we examine the C data structure called a structure. We assume that you 
are familiar with the structure from an introductory course: In this section we review 
some highlights of this data structure and point out some interesting and useful features 
needed for a more general study of data structures. 

A Structure is a group of items in which each item is identified by its own identi- 
fier, each of which is known as a member of the structure. (In many other programming 
languages, a struciure is called a “record” and 4 member is called a “field.” We may 
sometimes use these terms instead of “structure” or “member,” although both terms 
have different meanings in C.) For example, consider the following declaration: 


struct { 
char first[10]; 
char midinit; 
char last(20]; 
-3 sname, ename; 


This declaration creates two structure variables, sname and ename, each of which 
contains three members: first, midinit, and last. Two of the members are character 
strings, and the third is a single character. Alternatively, we can assign a tag to the 
structure and then declare the variables by means of the tag. For example, consider the 
following declaration that accomplishes the same thing as the declaration just given: 


42 
Introduction to Data Structures Chap,1 


struct nametype { 
char first[10); 
char midinit; 
char last(20]; 
}; 


struct nametype sname, ename; 


This definition creates a structure tag nametype containing three members, first, 
midinit, and last. Once a structure tag has been defined, variables sname and ename may 
be declared. For maximum program clarity, it is recommended that a tag be declared 
for each structure and variables then be declared using the tag. 


An alternative to using a structure tag is to use the typedef definition in C. For 
example 


typedef struct { 
char first(10]; 
- char midinit; 
char last[20]; 
} NAMETYPE; 


says that the identifier NAMETYPE is synonymous with the preceding structure speci- 
fier wherever NAMETYPE occurs. We can then declare 


NAMETYPE sname, ename; 


to achieve the declarations of the structure variables sname and ename. Note that struc- 
ture tags are conventionally written in lowercase but typedef specifiers are written in 


uppercase in presenting C programs. typedef is sometimes used to achieve the flavor of 
an ADT specification within a C program. . 


Once a variable has been declared as a structure, each member within that variable 


may be accessed by specifying the variable name and the item’s member identifier 
separated by a period. Thus, the statement 


printf("%s", sname.first); 
can be used to print the first name in the structure sname, and the statement 
ename.midinit = 'm' 
can be used to set the middle initial in the structure ename to the letter m. If a member 
of a structure is an array, a subscript may be used to access a à particular element of the 


array, as in 


for (i=0; i < 20; i++) 
sname. last[i}] = ename.last[i]; 


A member of a structure may be-declared to be another structure. For example. 
given the foregoing definition of namerype and the following definition of addrtype 
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Struct addrtype { 
char straddr[40]; 
char city[10]; . 
char state[3]; /*Allow room for two-character */ 
/* abbreviation and '\0'  */ 
char zip[6]; /*Allow room for five-character */ 
li zipcode and '\0' */ 
}i : 


we may declare a new structure tag nmadrype by 


struct nmadtype { 
struct nametype name; 
struct addrtype address; 
7 


If we declare two variables 
Struct nmadtype nmad1, nmad2; 
the following are valid statements: = 


nmadl.name.midinit = nmad2.name.midinit; 
nmad2.address.city(4] = nmadl.name.first{1]; 
for (i=l; i < 10; i++) 

nmad1.name.first[i] = nmad2.name.first[i]; 


ANSI standard C allows the assignment of structures of the same type. For ex- 
ample. the statement nmad] = nmad2; is valid and equivalent to 


nmadl.name = nmad2.name; 
nmad2.address = nmad2.address; 


These, in turn, are equivalent to 


for (i=0; i < 10; i++) 

nmadl.name.first{i] = nmad2.name.first[i]; 
nmadl.name.midinit = nmad2.name.midinit; 
for (i=0; i < 20; i++) 

nmadl.name.Jast{i] = nmad2.name.last{i]; 

“for (i=0; i < 40; i++) 

nmadl.address.straddr[i] = nmad2.address.straddr[i]; 
for (i=0; i < 10; i++) 

nmadl.address.city[i] = nmad2.address.city[i]; 
for (i=0; i < 2; i++) 

nmadl.address.state[i] = nmad2.address.state[i]; 
for (i=0; i < 5; i++) 

nmadl.address.zip[i] = nmad2.address.zip[i}; 


44 ; Introduction to Data Structures Chap. 1 


The reader is cautioned that many compilers, which are based on the original C lan- 
guage as defined by Kernighan and Ritchie, do not permit structure assignment. Thus 
it would be necessary to explicitly assign each member of one structure to another. In 
the remainder of the text we assume ANSI C compliance. 

Consider another example of the use of structures, in which we define structures 
describing an employee-and a student, respectively: 


Struct date { 
int month; 
int day; 
int year; 
He 
‘struct position { 
char deptno[2]; 
char jobtitle[20]; 
}; : 
struct employee { 
struct nmadtype nameaddr; 
Struct position job; 
float salary; 
int numdep; 
short int hplan; 
struct date datehired; 
Fi 
struct student { 
struct nmadtype nmad; 
float gpindx; 
int credits; 
struct date dateadm; 


k 
Assuming the declarations 


struct employee e; 
struct student s; 


a statement to give a 10 percent raise to an employee whose grade point index as a ` 
student was above 3.0 is the following where strcmp (s, t) returns 0 if strings s and t are 
equal. 


if ((stremp(e.nameaddr.name.first,s.nmad.name.first)==0) & 
(e.nameaddr.name.midinit == s.nmad:name.midinit) && 
(stremp(e.nameaddr.name.last,s.nmad.name. last)==0)) 
if (s.gpindx > 3.0) . 
e.salary *= 1.10; 


This statement first ensures that the erhployee record and the student recorg refer to the 
same person by comparing their names. Note that we cannot simply write 
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if (e.nmaddr.name == s.nmad.name) 


s.e 


since two structures cannot be compared for equality in a single operation in C. 

You may have noticed that we used two different identifiers nameaddr and nmad 
for the name/address members of the employee and student records, respectively. 
It is not necessary to do so and the same identifier can be reused to name. mem- 
bers of different structure types. This does not cause any ambiguity, since a member 
name must always be preceded by an expression identifying a structure of a specific 
type. 


Implementing Structures 


Let us now turn our attention from the application of structures to their implemen- 
tation. Any type in C may be thought of as a pattern or a template. By this we mean that 
a type is a method for interpreting a portion of memory. When a variable is declared 
as being of a certain type, we are saying that the identifier refers to a certain portion 
of memory and that the contents of that memory are to be interpreted according to the 
pattern defined by the type. The type specifies both the amount of memory set aside for 
the variable and the method by which memory is interpreted. . ; 

For example, suppose that under a certain C implementation an integer-is rep- 
resented by four bytes, a float number by eight, and an array of ten characters by ten 
bytes. Then the declarations 


int x; 
float y; 
char z[10]* 


specify that four bytes of memory be set aside for x, eight bytes be set aside for y, and 
ten bytes for z. Once those bytes are set aside for these variables, the names x, v, and 
z will always refer to those locations. When x is referenced, its four bytes will be in- 
terpreted as an integer; when y is’ referenced, its eight bytes will be interpreted as a 
real number; and when z is referenced, its ten bytes will be interpreted as a collection 
of ten characters. The amount of storage set aside for each type and the method by 
which the contents of memory are interpreted as specific types vary from one machine 
and C implementation to another. But within a given C implementation, any type al- 
ways indicates a specific amount of storage and a specific method of interpreting that 
storage. 
„Now suppose that we defined a structure by 


struct structtype { 
int fieldl; 
float fieid2; 
char field3[{10]; 


` 
’ 
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and declared a variable 
struct structtype r; 


Then the amount of memory specified by the structure is the sum of the storage spec- 
- ified by each of its member types. Thus, the space required for the variable r is the 
sum of the space required for an integer (4 bytes), a float number (8 bytes), and an 
array of 10 characters (10 bytes). Therefore, 22 bytes are set aside for r. The first 4 
of these bytes are interpreted as an integer, the next 8 as a float number, and the last 
10 as an array of characters. (This is not always true. On some computers, objects of 
certain types may not begin anywhere in memory but are constrained to start at cer- 
tain “boundaries.” For example, an integer of length 4 bytes may have to start at an 
address divisible by 4, and a real number of length 8 bytes may have to start at an ad- 
dress divisible by 8. Thus, in our example, if the starting address of r is 200, the integer 
occupies bytes 200 through 203, but the real number cannot start at byte 204, since 
that location is not divisible by 8. Thus the real number must start at location 208 and 
the entire record requires 26, rather than 22, bytes. Bytes 204 through 207 are wasted 
space.) ‘ 

For every reference to a member of a structure, an address must be calculated. 
Associated with cach member identifier of a structure is an offset that specifies how far 
beyond the start of the structure the location of that field is. in the foregoing example, the 
offset of field! is 0, the offset of field2 (assuming no boundary restrictions) is 4, and the 
offset of field3 is 12. Associated with each structure variable is a base address, which is 
the location of the start of the memory allocated to that-variable. These associations are 
established by the compiler and are of no concern to the user. To calculate the location of 
a member in a structure, the offset of the member identifier is added to the base address 
of the structure variable. ; 

For example, assume that the base address of r is 200. Then what really happens 
in executing a statement such as 


r.field2 = r.fieldl + 3.7; 


is the following. First, the location of r field1 is determined as the base address of r (200) 
plus the field offset of field1 (0), which yields 200. The 4 bytes at locations 200 through 
203 are interpreted as an integer. This integer is then converted to a float number that 
is then added to the float number 3.7. The result is a float number that takes up 8 bytes. 
The location of rfield2 is then computed as the base address of r (200) plus the field 
offset of field2 (4), or 204. The contents of the 8 bytes 204 through 211 are set to the 
float number computed in evaluating the expression. 

Note that the process of calculating the address of a structure component is very 
similar to that of calculating the address of an array component. In both cases an off- 
set that depends on the component selector (the member identifier or the subscript 
value) is.added to the base address of the compound structure (the structure or the ar- 
ray). In the case of a structure, the offset is associated with the field identifier by the type 
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definition, whereas in the case of an array, the offset is calculated based on the value of 
the subscript. l 

These two types of addressing (structure and array) may be combined. For exam- 
ple, to calculate the address of r field3[4], we first use structure addressing to determine 
the base address of the array rfield3 and then use array addressing to determine the lo- 
cation of the fifth element of that array. The base address of r field3 is given by the base 
address of r (200) plus the offset of field3 (12), which is 212. The address of rfield3(4] 
is then determined as the base address of rfield3 (212) plus 4 (the subscript 4 minus 
the lower array bound 0) times the size of each element of the array (1), which yields 
212 +4» 1, or 216. l 

As an additional example, consider another variable, rr, declared by 


struct structtype rr[20]; 


rr is an example of an array of structures. If the base address of rr is 400, then the 
address, of rr{14].field3[6] may be computed as follows. The size of each component 
of rr is 22, so the location of rr{14] is 400 + 14 * 22, or 708. The base address of 
rr[14].field3 is then 708 + 12, or 720. The address of rr{14] field3[6] is therefore 720 + 
6 * 1. or 726. (Again, this ignores the possibility of boundary restrictions. For example, 
although the type rectype may require only 22 bytes, each rectype may have to start at 
an address divisible by 4, so that 2 bytes are wasted between each element of rr and its 
neighbor: If such is the case, then the size of each element of rr is really 24, so that the 
address of rr{14].field3(6] is actually 754 rather than 726.) 


Unions 


Thus far each structure we have looked at has had fixed members and a single 
format. C also allows another type of structure, the union, which permits a variable to 
be interpreted in several different ways. 

For example, consider an insurance company that offers three kinds of policies: 
life, auto, and home. A policy number identifies each insurance policy, of whatever 
kind. For all three types of insurance, it is necessary to have the policyholder’s name, 
address, the amount of the insurance, and the monthly premium payment. For auto and 
home insurance policies, a deductible amount is needed. For a life insurance policy, 
the insured’s birth date and beneficiary are needed. For an auto insurance policy, a 
license number, state, car model, and year are required. For a homeowner's policy, an 
indication of the age of the house and the presence of any security devices is required. 
A policy structure type for such a company may be defined as a union. We first define 
two auxiliary structures. 


#define LIFE 1 ý 
#define AUTO 2 -> 
#define HOME 3 
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struct addr { 
char street(50]; R 
char city[10]; 
char state[3]; 
char zip[6]; 


struct date { 
int month; 
int day; 
‘int year; 
struct policy { 
int polnumber; 
char name (30); 
struct addr address; 
int amount; 
float premium; 
int kind; /* LIFE, AUTO, or HOME */ 
union { i 
struct { 
char beneficiary[30]; 
struct date birthday; 
} life; 
struct { 
int autodeduct; 
char license[10]; 
char state[3]; 
char model [15]; 
int year; 
} auto; 
struct { 
int homededuct; 
int yearbuilt; 
} home; 
} policyinfo; 


Let us examine the union more closely. The definition consists of two parts: a 
fixed part and a variable part. The fixed part consists of all member declarafions up to 
the keyword union, while the variable part consists of the remainder of the definition. 

Now that we have examined the syntax of a union definition, let ussauitifmne its 
semantics. A variable declared as being of a union type T (for example, struct policy 
p;) always contains all the fixed members of T. Thus, it is always valid to reference 
p.name or p.premium or p.kind. However, the union members contained in the value of 
such a variable depend on what has been stored by the programmer. 

It is the prograrnmer’s responsibility to make sure that the use of a member is 
consistent with what has been placed into that location. It is a good idea to maintain a : 
separate fixed member in a structure containing a union whose value indicates which 
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alternative is currently in use. In the foregoing example, the member kind is used for 
this purpose. If its value is LIFE (1), then the structure holds a life insurance policy; if 
AUTO (2), an auto insurance policy; and if HOME (3), a home insurance policy. Thus 
the programmer would be required to execute code similar to the following to reference 
the union: 


if (p.kind == LIFE) 
printf("\n%s %2d//%2d//%4d", p.policyinfo.life.beneficiary, 
p.policyinfo. life.birthday.month, 
p.policyinfo. life.birthday.day, 
Pp.policyinfo. life. birthday. year); 
else if (p.kind == AUTO) 
printf("\nk%d %s %s %s %d", p.policyinfo.auto.autodeduct, 
p.policyinfo. auto. license, 
p.policyinfo.auto.state, 
P.policyinfo.auto.model, 
Pp.policyinfo.auto. year); 
else if (p.kind == HOME) 
printf("\n%d %d", p.policyinfo.home.homededuct, 
p.policyinfo.home. yearbui]t); 
else 
printf("\nbad type %d in kind", p.kind); 


In the foregoing example, if the value of p.kind is LIFE, p currently contains 
members beneficiary and birthday. It is invalid to reference model or yearbuilt while 
the value of kind is LIFE. Similarly, if the value of kind is AUTO, we may reference 
autodeduct, license, state, model, and year but should not reference any other member. 
However, the C language does not require a fixed member to indicate the current alter- 
native of a union, nor does it enforce using a particular alternative depending on a fixed 
member’s value. 

A union allows a variable to take on several different “types” at different points 
in execution. It also allows an array to contain objects of different types. For example, 
the array a, declared by i 


struct policy a[100]; 


may contain life, auto, and home insurance policies. Suppose that such an array a is 
declared and that it is desired to raise the premiums of all life insurance policies and all 


home insurance policies ‘for homes built before 1950 by 5 percent. This can be done as 
follows: . 


for (i=0; i<100; i++) 
if (a[i].kind == LIFE) 
a[i].premium = 1.05 * a[i].premium; 
else if (ali].kind == HOME & 
ali] .policyinfo.yearbuilt < 1950) 
a[i].premium = 1.05 * ali). premium; 
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Implementation of Unions 


To fully understand the concept of a union, it is necessary to examine its imple- 
mentation. A structure may be regarded as a road map to an area of memory. It defines 


the same area of memory, and it is the responsibility of the programmer to determine 
which road map is in current use. In Practice, the compiler allocates sufficient storage 
to contain the largest member of the union. It is the road map, however, that deter- 


mines how that Storage is to be interpreted. For example, consider the simple union and 
structures ` i 


#define INTEGER 1 
#define REAL 2 


struct stint { 
int f3, f4; 
struct stfloat { 
float f5, f6; 
struct sample { 
int f1; 
float f2; 
int utype; 
union { 
struct stint x; 
struct stfloat y; 
} funion; 


bytes 100 through 103, 104 through 111, and 112 through 115, Tespectively. If the 
value of the member utype is INTEGER (that is, 1), bytes 116 through 119 and 120 
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>. 


can be used by only one of them at a time. The programmer determines which member 
is appropriate. 


Structure Parameters 


In traditional C a structure may not be passed to a function by means of a call by 
value. To pass a structure to a function, we must pass its address to the function and 
refer to the structure by means of a pointer (that is, call by reference). The notation p-> 
x in C is equivalent to the notation (*p).x and is frequently used to reference a member 
of a structure parameter. For example, the following function prints a name in a neat 
format and returns the number of characters printed: 


int writename (struct nametype “name) 
{ 


int count, i 


printf("\n"); 
count = 0; 
"for (i=0; (i < 10) && (name->first[i] != '\0'); i++) { 
printf("%c", name->first[i]); 
count++; ~ 
} /* end for */ 
printf("%e", ' '); 


count++; 

if (name->midinit != ' ') { 
printf("%c%s", name->midinit, ". "); 
count += 3; 


} /* end if */ 
for (i=0; (i < 20) & (name->last[i] != '\0'); i++) í 
printf("%c", name->last[i]); 
count++; 
} /* end for */ 
return(count) ; 
} /* end writename */ 


The following list illustrates the effects of the statement x = writename(&sname) on 
two different values of sname: 


Value of sname.first: “Sara” “Irene” 
xo Value of sname.midinit: ‘M’ n3 
Value of sname. last: “Binder” “LaClaustra” 
Printed output: Sara M. Binder Irene LaClaustra 
~~ Value of x: 14 16 


Similarly, the statement x = writename(&ename) prints the values of ename’s fields 
and assigns the number of characters printed to x. 

The original definition of C (by Kernighan and Ritchie) and many older C com- 
pilers do not allow a structure to be passed as an argument even if its value remains 
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unchanged. ANSI C, in addition to al lowing structure assignment, does allow structures 
to be passed by value and returned, without applying the & operator. This involves 
copying the value of the entire structure when the function is called. Thus if the structure 
is very large it is more efficient to pass the structure by reference (that is, using the & 
operator). In the remainder of the text, we therefore pass all structures by reference. 

We have already seen that a member of a Structure may be an array or another 
Structure. Similarly we may declare an array of structures. For example, if the types 
employee and student are declared as presented earlier, we can declare two arrays of 
employee and student structures as follows: 


Struct employee e[100]; 
Struct student s[100]; 


The salary of the fourteenth employee is referenced by e[ 13).salary, and the last 
name is referenced by e[13].nameaddrname.last. Similarly, the admission year of the 
first student is s[0].dateadm. year. 

As an additional example, we present a function used at the start of a new year 
to give a 10 percent raise to all employees with more than ten years seniority and a 5 
percent raise to all others. First, we must define a new array of structures. 


struct employee empset[100]; 
The procedure now follows: 


#define THISYEAR ... 
void raise (struct employee e[]) 


int i; 


for ‘i=0; i < 100; i++) 
if (e[i].datehired.year < THISYEAR - 10) 
e[i].salary *= 1.10; 
else 3 
e[i].salary *= 1.05; 
} /* end raise */ 


As another example, suppose that we add an additional member, sindex, to the 
definition of the employee structure, This member contains an integer and indicates the 


student index in the array s of the particular employee. Let us declare sindex (within 
the employee record) as follows: 


struct employee { 
Struct nametype nameaddr; 


Struct datehired ...; 
“int sindex; 


p 
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The number of credits earned by employee i when the employee was a student can then 
be referenced by s[e[i].sindex].credits. 

The following function can be used to give a 10 percent raise to all employees 
whose grade point index was above 3.0 as a student and to return the number of such 
employees. Note that we no longer have to compare an employee name with a student 
name to ascertain that their records represent the same person (although these names 
should be equal if they do). Instead the field sindex can be used directly to access the 


appropriate student record for an employee. We assume that the main program contains 
the declaration 


struct employee emp[100]; 
struct student stud[100]; 


int raise2 (struct employee e[], struct student s[]) 
{ 


int i, j, count; 


count = 0; 
for (i=0; i < 100; i++) { 
j = e[i].sindex; 
if (s[j].gpindx > 3.0) { 
count++; 
e[i].satary *= 1.10; 
} /* end if */ 
} /* end for */ 
return(count); 
} /* end raise2 */ 


Very often a large array of structures is used to contain an important data table 
for a particular application. There is generally only one table for each such array of 
structures. The student table s and the employee table e of the previous discussion are 
good examples of such data tables. In such cases, the unique tables are often used as 
Static/external variables rather than as parameters, with a large number of functions ac- 
cessing them. This increases efficiency by eliminating the overhead of parameter pass- 
ing. We could easily rewrite the function raise2 above to access s and e as Static/external 
variables rather than as parameters by-simply changing the function header to 


int raise2() 


The body of the function need not be changed, assuming that the tables s and e are 
declared in the outer program. i 


Representing Other Data Structures 


Throughout the remainder of this text, structures are used to represent more com- 

“plex data structures. Aggregating data into a structure is useful because it enables us 

to group objects within a single entity and to name each of these objects appropriately, 
according to its function. 
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As examples of how structures can be used in this fashion, let us consider the 
problems of representing rational numbers. i 


Rational Numbers 


In Section 1.1 we presented an ADT for rational numbers. Recall that a rational 
number is any number that can be expressed as the quotient of two integers. Thus 
1/2, 3/4, 2/3, and 2 (that is, 2/1) are all rational numbers, whereas sqr(2) and 7 are 
not. A computer usually represents a rational number by its decimal approximation. If 
we instruct the computer to print 1/3, the computer responds with .333333. Although 
this is close enough (the difference between .333333 and one-third is only one three- 
millionth), it is not exact. If we were to ask for the value of 1/3 + 1/3, the result would 
be .666666 (which equals .333333 + .333333), whereas the result of printing 2/3 might 
be .666667. This would mean that the result of the test 1/3 + 1/3 = = 2/3 would be 
false! In most instances, the decimal approximation is good enough, but sometimes it 
is not. It is therefore desirable to implement a representation of rational numbers for 
which exact arithmetic can be performed. 

How can we represent a rational number exactly? Since a rational number consists 


of a numerator and a denominator we can represent a rational number rational using 
structures as follows: 


struct rational { 
int numerator; 
int denominator; 


}i 
An alternative way of declaring this new type is 


typedef struct { 
int numerator; 
int denominator; 
} RATIONAL; 


Under the first technique, a rational r is declared by 
Struct rational r; 


under the second technique by 


RATIONAL r; 


You might think that we are now ready to define rational number arithmetic for 
our new representation, but there is one significant problem. Suppose that we defined 
two rational numbers r1 and r2 and we had given them values. How can we test if the 
two numbers are the same? Perhaps you might want to code f 


if (rl.numerator == r2.numerator ’& r1.denominator == 
r2. denominator) 
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That is, if both numerators and denominators are equal, the two rational numbers are 
equal. However, it is possible for both numerators and denominators to be unequal, yet 
the two rational numbers are the same. For example, the-numbers 1/2 and 2/4 are indeed 
equal, although their numerators (1 and 2) as well as their denominators (2 and 4) are 
unequal. We therefore need a new way of testing equality under our representation. 

Well, why are '1/2 and 2/4 equal? The answer is that they both represent the same 
ratio. One out of two and two out of four are both one-half. To test rational numbers 
for equality, we must first reduce them to lowest terms..Once both numbers have been 
reduced to lowest terms, we can then test for equality by simple comparison of their 
numerators and denominators. : . 

Define a reduced rational number as a rational number for which there is no 
integer that evenly divides both the denominator and the numerator. Thus 1/2, 2/3, and 
10/1 are all reduced, while 4/8, 12/18, and 15/6 are not. In our example, 2/4 reduced to 
lowest terms is 1/2, so the two rational numbers are equal. 

A procedure known as Euclid’s algorithm can be used to reduce any fraction of 


the form numerator/denominator into its lowest terms. This procedure may be outlined 
as follows: ' 


. Let a be the larger of the numerator and denominator and let b be the smaller. 
. Divide b into a, finding a quotient q and a remainder r (that is, a = q *b + r). 
. Seta = bandb = r. 

. Repeat steps 2 and 3 until b is 0. 

. Divide both the numerator and the denominator by the value of a. 


Mb wn = 


As an illustration, let us reduce 1032/1976 to its lowest terms. 


Step 0 numerator = 1032 denominator = 1976 
Step 1 a= 1976 b= 1032 

Step 2 a= 1976 b= 1032 q=1 r=944 
Step 3 a= 1032 b= 944 

Steps 4 and 2 a = 1032 b= 944 q=1 r=88 
Step 3 a=944 b=88 

S@ps4and2 a =944 b=88 q=i0 r=64 
Step 3 a = 88 b = 64 

Steps 4 and 2 a = 88 b= 64 q=1 r= 24 
Step 3 a=64 b=2 d 
Steps 4 and 2 a = 64 b = 24 q=2 r=16 
Step 3 a= 24 b= 16 

“Steps 4 and 2 a= 24 b= 16 q=] r=8 
Step3 N a= 16 b=8 

Steps 4 and 2 a= 16 b=8 q=2 r=0 

` Step 3 a=8 b=0 

Step 5 1032/8 = 129 1976/8 = 247 


Thus 1032/1976 in lowest terms is 129/247. 
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Let us write a function to reduce a rational number (we use the tag method for 
declaring rationals). 


void reduce (struct rational *inrat, struct rational *outrat) 
{ = 


int a, b, rem; 


if (inrat->numerator > inrat->denominator) { 
a = inrat->numerator; 
b = inrat->denominator; 

} /* end if */ 

else { 
a ='inrat->denominator; 
b = inrat->numerator; 

} /* end else */ 

while (b != 0) { 
rem=a%b; 
a= bs 
b = rem; 

} /* end while */ 

Outrat->numerator /= a; 

outrat->denominator /= a; 

} /* end reduce */ 


Using the function reduce, we can write another function equal that determines 
whether or not two rational numbers r1 and r2 are equal. If they are, the function returns 
. TRUE; otherwise, the function returns FALSE. 


#define TRUE 1 
#define FALSE 0 


int equal (struct rational *ratl, struct rational *rat2) 
struct rational rl, r2; 


reduce(ratl, &r1); 
reduce(rat2, &r2); 
if (rl.numerator == r2.numerator & 
rl.denominator == r2.denominator) 
return(TRUE) ; 
return(FALSE); 
} /* end equal */ 


We may now write functions to perform arithmetic on rational numbers. We 
present a function to multiply two rational numbers and leave as an exercise the prob- 
lem of writing similar functions to add, subtract, and divide such numbers. 
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void multiply (struct rational *rl, struct rational *r2, struct rational *r3) 
/* r3 points to the result of multiplying *rl and *r2 */ 
{ 

Struct rational rat3; 

= 

rat3.numerator’ = rl->numerator + r2->numerator; 

rat3.denominator = rl->denominator + r2->denominator; 

reduce(&rat3, r3); 
} /* end multiply */ 


Allocation of Storage and Scope of Variables 


Until now we have been concerned with the declaration of variables. that is, the > 
description of a variable’s type or attribute. Two important questions, however, remain 
to be answered: At what point is a variable associated with actual storage (that is, stor- 
age allocation)? At what point in a program may a particular variable be referenced 
(that is, scope of variables)? f 

In C variables and parameters declared within a function are known as automatic 
variables. Such variables are allocated storage when the function is invoked. When the 
function terminates, storage assigned to those variables is deallocated. Thus automatic 
variables exist only as long as the function is active. Furthermore, automatic variables 
are said to be local to the function. That is, automatic variables are known only within 
the function in which they are declared 2nd may not be referenced by other functions. 

Automatic variables (that is, parameters in a function header or local variables 
immediately following any opening brace) can be declared within any block and remain 
in existence until the block is terminated. The variable can be referenced throughout the 
entire block unless the variable identifier is redeclared within an internal block. Within 
the internal block, a reference to the identifier is to the inner-most declaration, and the 
outer variable cannot be referenced. 

The second class of variables in C are the external variables. Variables that are 
declared outside any function are allocated Storage at the point at which they are first 
encountered and remain in existence for the remainder of the program’s execution. The 
Scope of an external variable lasts from the point at which it is declared until the end 
of its containing source file. Such variables may be referred to by all functions in that 
source file lying beyond their declaration and are therefore said to be global to those 
functions. : 

A special case is when the programmer wishes to define a global variable in one 
source file and to refer to the variable in another source file. Such a variable must be 
explicitly declared to be external. For example, suppose that an integer array containing 
grades is declared in source file 1] and it is desired to refer to that array throughout a 
source file 2. The following declarations would then be necessary: 


file 1 #define MAXSTUDENTS ... 
int grades [MAXSTUDENTS] ; 


end of file 1 
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file 2 extern int grades[]; 


float average() 


} /* end float */ 


float mode() z 
{ 


} /* end mode */ 
end of. file 2 


When file 1 and file 2 are combined into one program, storage for the array grades 
is allocated in file 1 and remains allocated until the end of file. 2. Since grades is an 
external variable, it is global from the point at which it is defined in file 1 to the end 
of file 1 and from the point at which it is declared in file 2 to the end of file 2, Both 
functions average and mode may therefore refer to grades. 

Note that the size of the array is specified only once, at the point at which the 
variable is originally defined. This is because a variable that is explicitly declared to 
be external cannot be redefined, nor can any additional storage be allocated to it. An 
extern declaration merely serves to declare for the remainder of that source file that 
such a variable exists and has been created earlier. 

Occasionally it is desirable to define a variable within a function for which stor- 
age remains allocated throughout the execution of the program. For example, it might 
be useful to maintain a local counter in a function that would indicate the number of 
times the function is invoked. This can be done by including the word static in the 
variable declaration. A static internal variable is local to that function but remains in 
existence throughout the program’s execution rather than being allocated and deallo- 
cated each time the function is invoked. When the function is exited and reentered, a 
Static variable retains its value. Similarly, a static external variable is also allocated 
storage only once, but may be referred to by any function that follows it in the source 
file. 

For purposes of optimization, it might be useful to instruct the compiler to main- 
tain the storage for a particular variable in a high-speed register rather than in ordinary 
memory. Such a variable is known as a register variable and is defined by including 
the word register in the declaration of an automatic variable or in the formal parameter 
of a function. There are many restrictions on register variables that vary from machine 
to machine. The reader is urged to consult the appropriate manuals for details on these 
restrictions. 

Variables may be explicitly initialized as part of a declaration. Such variables 
are conceptually given their initial values prior to execution. Uninitialized external and 
static variables are initialized to 0, whereas uninitialized automatic and register vari- 
ables have undefined values. 

To illustrate these rules consider the following program: (The numbers to the left 
of each line are for reference purposes.) 
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source filel.c 


l int x, y, z; 

2 void funcl() 

3 { 

4 int a, b; 

5 x21; 

6 y=2; 

7 Za 3 

8 a = 1; 

9 b = 2; 
10 printf("%d %d %d %d %d\n", x, Yazi Asbi 
11 } /* end funci */ 
12 void func2() 
Ti i 
14 int a; 


15 a= 5; 
16 printf ("%d %d %d %d\n", x, y, z, a); 
17 } /* end func2 */ 


end of source filel.c 
source file2.c 


18 #include <stdio.h> 
19 #include <filel.c> 


20 extern int x, y, z; 

21 void main() 

22 -f 

23 funcl(); 

24 printf("%d %d %d\n", x, y, z); 
25 func2(); 

26 func3(); 

27 func3(); 

28 func4(); 

29 printf("Xd %d %d\n", x, y, z); 
30 } /* end main */ 


3l void func3() 
{ 


32 

33 static int b; /* b is initialized to 0 */ 
34 y++; 

35 b++; 


36 printf("%d Xd %d %d\n", x, y, z, b); 
37 } /* end func3 */ 
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t 


38 void func4() 
3 4 
40 t tayt 


41 x = 10; 
42 y = 20; 
43 z = 30; 


44 printf("%d %d %d\n", x, y, 2); 
45} /* end func */ 


end of source file2.c 


Execution of the program yields the following results: 


aS e SL’ 

b 123 

c 1235 

d 13 3.1 

e 1432 9 
f 10 20 30 

g 143 


Lei us trace through the program. Execution begins with line 1, in which ex- 
ternal integer variables x, v, and z are defined. Being externally defined, they will be 
known (global) throughout the remainder of file1.c (lines 1 through 17). Execution then 
proceeds to line 20, which’declares by means of the word extern that the external in- 
teger variables x, y, and z are to be associated with the variables of the same name in 
line 1. No new storage is allocated at this point, since storage is allocated only when 
these variables are originally defined (line 1). Being external, x, y, and z will be known 
throughout the remainder of file2.c, with the exception of func4 (lines 38 through 45), 
where the declaration of local automatic 5 sine x; » and z (line 40) supersedes the 
original definition. 

Execution begins with main(), line 21. This immediately invokes func1. funci 
(lines 2 through 4) defines local automatic variables a and b (line 4) and assigns values 
to the global variables (lines 5 through 7) and to its local variables (lines 8 through 9). 
Line 10 therefore produces the first line of output (line a). Upon termination.of funcl 
(line 11) storage for variables a and b is deallocated. Thus, no other function will be 
able to refer to these variables. 

Control is then returned to the main function (line 24). The output is given in line 
b. It then invokes func2. func2 (lines 12 through 17) defines a local automatic variable, 
a, for which storage is allocated (line 14) and a value assigned (line 15). Line 16 refers 
to the external (global) variables x, y, and z previously defined in line 1 and assigned 
values in lines 5 through 7. The output-is‘viven in line 5. Note that it would be illegal 
for func2 to attempt to print a value i b, since this variable no longer exists, being 
allocated only within func. 

The main program then invokes fune3 twice (lines 26 through 27). func3 (lines 31 
through 37), when called for the first time, allocates storage for the static local variable b 
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and initializes it to 0 (line 33). b will be known only to func3; however, it will remain in 
existence for the remainder of the program’s execution, Line 34 increments the global 
variable y, and line 35 increments the local variable’ b. Line d of the Output is then 
printed. The second time func3 is invoked by the main program, new storage for b 
is not allocated: thus, when b is incremnted in line 35 the old value of b (from the 
Previous invocation of func3) is used. T final value of b thus will reflect the number 
of times that func3 was invoked. 

Execution then continues in the main function that invokes func4 (line 28). As 
was mentioned earlier, the definition of internal-automatic integer variables x, y, and z ` 
in line 40 supersedes the definition of x, y. and z in lines 1 and 20, and remains in force 
only within the scope of func4 (lines 38 through 45), Thus the assignment of values 
in lines 41 through 43 and the output (line f) resulting from line 44 refer only to these 
local variables. As soon as func4 terminates (line 45) these variables are destroyed. 
Subsequent references to x, y. and z (line 29) refer to the global x, y, and z (lines | and 
20) producing the output of line g. 


EXERCISES 


1.3.1. Implement complex numbers, as specified in Exercise 1.1.8, using structures with real 
and complex parts. Write routine: to add. multiply, and negate such numbers, 
1.3.2. Suppose that a real number is represented by a C structure such as 


struct realtype { 
int left; 
int right; 


where left and right represent the digits to the left and tight of the decimal point, Tespec- 

tively. If left is a negative integer, the represented real number is negative. 

(a) Write a routine to input a real number and create a structure representing that num- 
ber. 

(b) Write a function that accepts such a structure and returns the real number repre- 

* sented by it. 

(c) Write routines add, subtract, and multiply that accept two such structures and set 
the value of a third structure to represent the number that is the sum, difference, 
and product, respectively, of the two input records. 

1.3.3, Assume that an integer needs four bytes, a real number needs eight bytes, and a char 
needs one byte. Assume the following definitions and declarations: 


struct nametype { 
char first[10]; 
char midinit; 
char last[20]; 
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1.3.4. 


1.3.5. 
1.3.6. 


1.3.7. 


struct person { 
Struct nametype name; 
int birthday[2]; 
struct nametype parents(2]; 
int income; 
int numchildren; 
char address{20]; 
char city[10]; 
char state[2]; 

hi 

Struct person p[100]; 


if the starting address of p is 100, what are the starting addresses (in bytes) of each of 
the following? 


(a) p[10] 

(b) p[20].nane.midinit 

(c) p[20].income 

(a) p[{20}.address[5} 

(e) p{5].parents(1].last[10] 


Assume two arrays, one of student records and the other of employee records. Each 
student record contains members for a last name, a first name, and a grade point index. 
Each employee record contains members for a last name, a first name, and a salary. Both 
arrays are ordered in alphabetical order by last name and first name. Two records with 
the same last name/first name do not appear in the same array. Write a C function to give 
a 10 percent raise to every employee who has a student record and whose grade-point 
index is greater than 3.0. 

Write a function as in the preceding exercise, but assuming that the employee and student 
records are kept in two ordered external files, rather than in two ordered arrays. 

Using the rational number representation given in the text, write routines to add, subtract, 
and divide such numbers. 

The text presents a function equal that determines whether or not two rational numbers 
rl and r2 are equal by first reducing rt and r2 to lowest terms and then testing for 
equality. An alternative method would be to multiply the denominator of each by the 
numerator of the other and test the two products for equality. Write a function egual2 to 
implement this algorithm. Which of the tvo methods is preferable? 


1.4 CLASSES IN C++ 


In this section, we introduce the C++ language and the concept of a C++ class. A 
class embodies the concept of an abstract data type by defining both the set of values of 
a given type and the set of operations that can be performed on those values. A variable 
of a class type is known as an object and the operations on that type are called methods. 
When one object;A invokes a method m on another object B, we sometimes say that “A 
is sending message m to B.” B is viewed as receiving that message and carrying out a 
transformation in response to that message. 
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The Class Rationai 


To illustrate the concept of a C++ class, consider the abstract data type RA- 
. TIONAL that we introduced in Section 1.1. That ADT modeled a rational number as 

consisting of two components, a numerator and a denominator, and defined methods 
to check'if two rational numbers were equal, to add two rationals, to multiply two ra- 
tionals and to-create a rational from two integers. In Section 1.3 we implemented the 
ADT RATIONAL as a C structure and presented C functions to implement its opera- 
tions. We recommend that you reread the portions of Sections 1.1 and 1.3 dealing with 
the definition and implementation of the ADT RATIONAL before proceeding. 

A C++ class builds on the concept of a C structure. Whereas a C structure isa 
collection of named fields, a C++ class is a collection of named fields and methods 
(or functions) that apply to objects of that class type. Additionally, the C+ + language 
implements the concept of information hiding, restricting access to certain members 
of the class to methods of the class itself. 


For example, the C++ definition of a class to implement the ADT RATIONAL 
might he the following: 


class Rational{ 
Jong numerator; = 
long denominator; 
void reduce (void); 
public: ii , 
Rational add(Rational); 
Rational mult(Rational); 
Rational divide(Rational); 
int equal (Rational); 
void print(void); 
void setrational(long, long); 
} 


This class, named Rational, contains two data members, numerator and denom- 
inator, and seven method members, reduce, add, mult, divide, equal, print, and set- 
rational, These seven methods are merely defined here; their actual implementations 
must be provided subsequently. 

The methods add, mult, and equal implement the ADT functions of the same 
name. We have added a method divide to divide one rational number by another. While 
the corresponding ADT functions take two parameters, the methods in the class Ra- 
tional explicitly mention only one. This is because the class object for which they 
are invoked is an implicit parameter for each routine. We will see how this is done 
shortly. 

The method setrational is used to set the value of a Rational to the rational number 
formed by a particular numerator and denominator. Further, the members are divided 
into two groups: numerator, denominator and reduce are private. That is, they can be 


referenced only from within the methods of the class Rational. This could have been 
made explicit by stating: 
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class Rational { 
private: 
Jong numerator; 


but. by default, the members defined at the beginning of a class definition are private 
without the need for explicitly stating this. The members setrational, add, mult, divide, 
equai, and print, by contrast, are public. This means that they can be referenced outside 
the methods of the class Rational. 

The reasons for doing this are simple. We do not want “outsiders” manipulating 
either the rational or denominator members. They are merely a way of implementing 
a rational number and are to be used solely for that purpose. An externa! function ma- 
nipulates a Rational; only within the internal methods of Rational should we be able 
to access numerator and denominator. Similarly, the method reduce is a function to 
reduce the internal representation of the Rational (that is, the numerator and denomi- 
nator) to lowest terms. We intend to use reduce to ensure thai every rational number is ' 
kept in lowest terms. The outside worid has no cause to call reduce. Every method that 
manipulates the internal numerator and denominator (that is, setrational, add, mult, and 
divide) is a member of the class Rational and wiil automatically ensure that the result- 
ing number is in reduced form by calling reduce. We will see this when we present the 
implementations of these methods. There is no need for anyone else to call reduce, and 
therefore reduce is defined as private. ‘ 

On the other hand. the methods setrationai, add, mult, equal, and print are public. 
These functions form the public interface for the class Rational. That is, they are the 
methods by which the outside world can manipulate and use objects of type Rational. 


Using the Class Rational 


We now present an example of the use of the class Rational. Suppose that the class 
Rational, together with the implementations of its methods, were defined in a header 


file rational.h. Then, suppose that we wanted to write a program to do the following: 
Input lines of the form 


op ra rb; 

where op is the character + or *, and ra and rb are either integers or of the form 
a / b 

where a and b are integers. For example, the following are all valid input lines: 
eee as 

This asks to add the integers 3 and 7 to produce 10. 


ee a 
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This asks to add the rational 3/4 and the integer 5 to produce the rational 23/4. 
* 34 f B 

This asks to multiply the integer 3 and the rational 4/8 to produce the rational 3/2. 
+ 3 f/ 4S ff 6 


This asks to add the rationals 3/4 and 5/6 to produce the rational 19/12, The program 
should read the line, perform the indicated computation, and print out the resulting 
rational. 

To assist with the I/O, we assume three routines: int readtoken(char * *), long 
atol(char *), and void error(char *). The function readtoken reads the next operator or 
integer in character form (for example, “/’ or “389"), allocates Storage for the string 
using the stdlib function calloc, and sets a pointer of type char * to it. Should the end 
of file be encountered, readtoken returns a value of EOF; otherwise, it returns 'EOF. 
The sidlib function atol converts a numerical string to an integer, The function void 


error(char *) prints its parameter as an error message and halts execution. The program 
is as follows: ` 


#include "rational.h" 
#include <iostream.h> 
#include <string.h> 
#include <stdlib.h> 


void main() 

{ 
int readtoken(char * *); 
void error(char +); 


char *optr, *tokenl, *token2, *token3; 
int intl, int2; 
Rational opndl, opnd2, result; 


while (readtoxen(&optr) != EOF) {  // read the operator 


readtoken(&token1) ; // read the first integer's 
// character string 
inti = atol(tokenl); // convert the first token 


// to an integer 
readtoken(&token2); 
if (stremp(token2, "/") != 0 
// convert the integer operand to a Rational 
opndl.setrational(intl, 1); 
else { l ; 
// get the denominator of the Rational operand 
readtoken(&token3); 
int2 = ato?(token3); 
// convert the numerator and deneminator to a Rational 
opndl.setrational(intl, int2); 
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readtoken(&toxken2); 
} /* ent if */ 
// get the second operand 
intl = atol(token2); 
readtoken(&token2); 
if (strcmp(token2, "/") != 0) 
// convert the integer operand to a Rational 
opnd2.setrational(intl, 1); 
else { 
// get the denominator of the Rational operand 
readtoken(&token3) ; 
int2 = atol(token3); 
// convert the numerator and denominator to a Rational 
opnd2.setrational(intl, int2); 
readtoken(&token2); 
} /* end if */ 
if (stremp(token2, ";") != 0) 
error ("ERROR: ; expected, not found.”); 
// apply the operator to the Rational operands 
if (*optr == '+') 
result = opndl.add(opnd2) ; 
else if (*optr == '*') 
result = opndl.mult(opnd2) ; 
else 
error("ERROR: illegal operator; must be * or + "); 
resuit.print(); 
} /* end while */ i | 
} /* end main */ p 


In the declarations, the variables opnd}, opnd2. and result are declared as of type 
Rational. This makes them objects of that class. That is, each one contains a numerator 
and denominator, and can be used to call the methods of class Ratignal; including add, 
mult, setrational, and print. 

Note how the methods of Rational are called. If opàùdl and opnd2 are Rationals, 
then the call opnd1.add(opnd2) adds the two rational numbers represented by opnd] 
and opnd2 and produces a Rational representing the result. We say that the program 
“sends the message” add tg epnd 1. Similarly, the call result.print() sends the message 


print to result and the call opnd1 .setrational(intl, int2) sends the message setrational 
to opnal. 


implementing the Methods á 


The methods of a class can be implemented within the declaration of the class or 
outside it. For example, the method setrational, which sets an object of type Rationai 
to a particular value, can be implemented within the class declaration as foliows: 


class Rational 
long numerator; 
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public: 
Rational add(Rational); 


void print(void); 
void setrational (long n, long d) 


if (d == 0) 

error ("ERROR: denominator may not be zero"); 
numerator = n; 
denominator = d; 


reduce(); // reduce to lowest terms 
} /* end setrational */ 
} /* end Rational */ 


The body of the function setrational appears within the declaration of Ratio- 
nal. Note that serrational references the members numerator and denominator. Private 
members of a class, such as numerator and denominator, can be referenced directly, 
with no needed qualification, from a method of that class, but they cannot ordinarily be 
referenced by a function outside the Class. Similarly, the method reduce is called within 
the method serrational with no qualification. This refers to the method reduce defined 
within the method Rational. 

When setrational is called, as in the statement opnd1.setrationaltintl, int2);, re.- 
erences to numerator and denominator within setrational will refer to opnd\.numerator 
and opnd1.denominator, and the call reduce() is to opndl.reduce(). Alternatively, set- 
rational can be defined outside the declaration for Rational. Rational must still contain 
the header of the function 


void setrational(long, long); 


but it need not contain its body. The body can be provided after the decfiration for 
Rational is completed, as follows: 


void Rational: :setrational (long n, long d) 
{ 
if’ (d <= 0) 

error("ERROR: denominator may not be zero"); 
numerator = n; 
denominator = d; 


ʻ*  reduce(); // reduce to lowest terms 
} /* end setrational */ 
Note the header line 


void Rational: :setrational(long n, long d) 


which specifies that we are defining one of the methods within the type*Rarional. The 
notation “Rational::" introduces a scope and specifies that the function serrational be- 
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ing defined is a method of class Rational. Once this scope has been opened, We can 
utilize the private members numerator and denominator and the private method reduce. 

Hete are the specifications of the remaining methods of RATIONAL. The routine 
for reduce follows closely the algorithm of Section 1.3. We first make sure that mmer- 
ator and denominator are both positive, keeping track of the sign. 


void Rational::reduce (void) 
{ 


int a, b, rem, sign; 


if (numerator == 0) 
denominator = 1; 
sign = 1; //assume positive 
/{ check if any negatives 
if (numerator < 0 && denominator < 0) { 
numerator = -numerator; 
denominator = -denominator; 
} 
if (numerator < 0) { 
numerator = -numerator; 
sign = -1; 
} i 
if (denominator < 0) { 
denominator = -denominator; 
sign = - 
} 
if (numerator > denominator) { 
a = numerator; 
b = denominator; 


} 
else { 
a = denominator; 
b = numerator; 
3 ’ 
$ 
while (b != 0) { 
rem = a % b; 
a= bD; 
b = rem; 


y 


I 
numerator = Sign * numerator / a; 
denominator = denominator / a; 

} /* end reduce */ 


To add two rational numbers, we could first reduce each to Jowest term, then 
multiply the two denominators to produce a resulting denominator, then multiply each 
numerator by the denominator of the other rational numbers and add the two products 
to produce the numerator. The result can then be reduced to lowest terms. However, this 
provides the danger that the product of the two denominators may be too large even for 
a long variable. 
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AHStead, We usc the loliowing algorithm to add a/b to c/d. We assume that the 
value rden(x, y) denotes the denominator of x/y reduced to lowest terms: 


k = rden(b, d); 
denom = b*k; // the resulting denominator 
num = a*k + c*(denom/d); // the resulting numerator 


num is the numerator of the sum, denom is the denominator, We leave it as an exercise 
to show that this algorithm is- correct. 

Implementing this algorithm in the context of the class Rational provides the fol- 
lowing definition of the method add: 


Rational Rational::add(Rational r) 
{ 

int k, denom, num; 

„Rational rnl; 


// first reduce both rationals to lowest terms 
reduce(); = 
r.reduce(); 


// implement the line k = rden(b, d); of the algorithm 
rnl.setrational (denominator, r. denominator); 

rn}. reduce(); 

k = rnl.denominator; 


// compute the denominator of the result 
// algorithm line denom = b*k; 
denom = denominator * k; 


// compute the numerator of the result 
// algorithm line num = a*k + c*a(denom/d); 
num = numerator*k + r.numerator*(denom/r.denominator); 


// form a Rational from the result and reduce 
// the result to lowest terms 
rnl.setrational(num, denom); 

rni.reduce(); 


return rol; 
} /* end add */ 


In multiplying two reduced rationals a/b and c/d. the straightforward method is to 
compute (a*c)/(b»d). However, here again we want to avoid the multiplication of a*c 
and bd if at all possible. since their intermediate results may be too large. The solution 
is first to reduce a/b and c/d to lowest terms, then to reduce a/d and c/b. In that way we 


are certain that ae has no terms in common with bed and that the products are as small 
as possible. 
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Here is the method mult implementing these ideas: 
Rational Rational: :mult(Rational r) 


Rational rnl, rnll, rnl2; 
int num, denom; 


// reduce both inputs to lowest terms 
reduce(); 
r.reduce(); 


// switch numerators and denominators and reduce 
rnll.setrational (numerator, r.denominator); 
yeni. reduce() ; 

rni2.setrational(r.numerator , denominator); 
rni2.reduce(): 


// compute result 
num = rnil.numerator * rni2.numerator; 
denom = rnil.denominator * rn12.denominator; 
rni.setrational(num, denom) ; 
return rnl; 

} /* end mult */ 


The method divide simply multiplies by a reciprocal. 


Rational Rational; :divide(Rational r) 
{ ` 


Rational rnal; 


// Compute the reciprocal of r 
rnl.setrational(r.denominator, r.numerator); 


// Multiply by the reciprocal 
return mult(rnl); 


} ° 


. The method equal reduces both rationals to lowest terms and then checks for 
equality of both numerators and denominators. i 


int Rational: :equal(Rational r) 


reduce(); 

> r.reduce(); 

if (numerator == r.numerator & 

denominator == r.denorinator) 
return TRUE; 

else ‘ 
return FALSE; 
} /* end equal */ 
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To implement the method print we first must decide on a format for the out; ut. A 
reasonable format might be to print the numerator followed by a slash followed by the 
denominator. We adopt this fo mat in the routine below: 


void Rational: :print(void) 


Cout << numerator << "/" << denominator << end]; 
} /* end print */ 


This utilizes the C+ + I/O facilities of the header file iostream.h. cout is an output 
stream and the operator << is used to send data values to the output. 


Overioading 


While we have a routine for adding two rational numbers, we cannot yet add a 
rational v and an integer į without first converting the integer to a rational using the call 
Setrationali, 1). 

Fortunately, C++ allows function names to be overloaded. That is. the same 
function name can apply to different functions if their parameters are of different types. 


Specifically, we can define another method add in the class Rational by including the 
line 


Rational add(long) ; 


in the public section. There are now two methods named add: one appliea to Rationals 
and one applied to integers. Implementation of the new method is straightforward: 


Rational Rational: :add(long i) 
{ 


Rational r; 


r.setrational(i, 1); 
return add(r); 


The implementation is as follows: First. form a new rational! out of the integer using 
Setrational, then call the existing add routine on rationals to add r to the rational number 
of the current object. 

Jf i is an integer, the call rradd(i) is a call to the second add method to add an 
integer to the rational rr. If ris a rational, the call rzadd(r) is a cal! to the original add 
method to add a rational to rr. 


Inheritance 


However, this technique for adding an integer is not entirely satisfactory, Un- 
der the method we have presented, the rational number of the current object is the first 
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‘operand, so we can implement the concept r+é, However, we cannot implement i+r 
equally directly. For addition, this is nota real problem since r+i equals i+r because 
of the commutative law. But consider the case of division, which is not commutative. 
We can write a method Rational divide(int) to compute 7/i, but how do we compute i/r? 

Of course, we could write a separate routine that is not part of a class. with two 
parameters. as follows: 


ational divide(long i, Rational r) 


{ 
Rational rr; 
rr.setrational(i, 1); 
return rr.divide(r); 
} 


However. this makes the division operator nonsymmetrical and breaks the concept of 
class operations. 

Instead, we can note that integers are a form of rationals. We can therefore rep- 
resent every integer by a rational. We do this by defining a new class /nteger which 
inherits the members of the class Rarional. In this new class, we waat to make sure 
that the denominator is always 1, so we redefine the method setrational. In fact. we 
define two versions of setrational. 

Here is the definition of the new class: 


class Integer:public Rational { 
public: f 
void setrational(long, long); 
void setrational (long); 

hi 
The class Rational is called the base class of the class Integer. 

However, in order for sefrational to access the Rational members numerator and 
denominator, those members cannot have been defined as private. A private member 
can only be accessed by the methods of the class itself. not even by the methods of 
an inherited class. To allow Rational to serve as a base class for Jnteger and give In- 
teger access to its members, yet to keep those members inaccessible from the rest of 
the program, the members must be defined as protected rather than private. The new 
definition of Rational would then be 


class Rational { ° 
protected: 
long numerator; 
long denominator; 
void reduce{void) ; 
public: 
}: 


ay 
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Let us now implement the two methods named setrational in the definition of 
Integer. The first setrational is included to override the routine setrational in the class 


Rational so that a noninteger rational is not accidentally assigned to an object of type 
Integer. It is implemented as follows: 


void Integer: :setrational (long num, long denom) 


{ 
if(denom != 1) 
error("ERROR: non-integer assigned to Integer variable"); 
numerator = num; 
denominator = 1; 
} 


The more usual version of Integer::setrational, with one parameter, is as follows: 


void Integer: :setrational(long num) 
i? 

numerator = num; 

denominator = 1; 


Now if ris a Rational and iis an Integer, then any of the following calls are valid: 


r.add(i) 
j.add(r) 
r.divide(i) 
j.divide(r) 


The methods add and divide. defined for Rational, are inherited by Integer and 
can be invoked for an /nteger variable. 


Note that the definition of the class Integer begins with the line 
class Integer:public Rational { 


The indication public in this line specifies that Integer has access to the protected and 
public members of Rational, and they become, in turn, protected and public members of 
Integer, However, if Rational were a protected base class (that is, protected appeared 
instead of public in the opening line of the /nteger definition), the public members 
of Rational would become protected members of Integer. Similarly, if Rational were a 


private base class, the public and protected members of Rational would become private 
members of Integer. 


Constructors 


A constructor is a special method of a class that is invoked whenever an object 
of that class is created. A constructor always is named with the same name as the class 
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itself. In our example above, we used the method sefrational to initialize a Rational 
object. We could have used a constructor instead. ; 

; For example, suppose that we include in the class definition of Rational the fol- 
lowing three members, all public and all named Rational. > 


Rational (void); 
Rational (long) ; 
Rational(long, long); 


They are implemented as follows: 


Rational: :Rational (void) 


{ 
// assume the rational number is 0 
numerator = 0; 
denominator = 1; 
} g 
Rational: :Rational (long i) 
{ 
numerator = 1; 
denominator = 1; 
} / 
Rational: :Rational (long num, long denom) 
{ 
numerator = num; 
denominator = denom; 
} 


Then when we declare an object to be a Rational, the appropriate constructor is 
invoked, The declaration 


Rational r; 


automatically initializes r to the rational zero (O/1) since that is what the constructor 
Rational does with no parameters. The declaration 


Rational r(3); 


sets r to the rational 3/1, since it invokes the second version of the constructor, Finally, 
the declaration 


Rational r(2,5); 
sets r to the rational 2/5, invoking the third version of Rational, with two parameters. 
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The operator new in C++, applied to a type designator, allocates a new object of 


the given type and returns a pointer to it. When new is called. the constructor is also 
invoked automatically. Thus the statement 


Rational *p = new Rational; - 


declares a pointer variable p. allocates a new object of type Rational, initializes it to 0: 


(since that is what the constructor with no arguments does), and sets p to point to the 
object. 


The statement 


Rational *p = new Rational(2,5); 


Sets p to point to a newly allocated Rational object with value 2/5, 


We can also use the constructor in a statement such as 


r = Rational(7); 


which sets r to the rational number 7/1. 


EXERCISES 

1.4.1. Write a method negare for the class Rational that returns the negative of a rational num- 
ber. 

1.4.2. Write a method subtract for the class Rational that returns the result of subtracting one 

rational number from another. i 

1.4.3. Define a class String that represents a string by a length and a pointer to a siring of 

characters, * 

(a) Write a constructor for Siring to allocate appropriate storage for it and to initialize 
it to a given C string. To allocate storage for an array of characters of length N. use 
the C++ operation 

new char[N] 

(b) Write a constructor for String to allocate Storage of a given size for the string but 
not lo initialize its characters. 

(¢) Write a method concar to concatenate one String with another. 

1.4.4. Rewrite the routines of this section to use the constructors Rational rather than the 
method setrational. 
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The Stack 


One of the most useful concepts in computer science is that of the stack, In this chapter 
we shall examine this deceptively simple data structure and see why it plays such a 
preminent role in the areas of programming and programming languages. We, shall 
define the abstract concept of a stack and show how that concept can be made into a 
concrete and valuable tool in problem solving. 


/ 


.1 DEFINITION AND EXAMPLES 


A stack is an ordered collection of items into which new items may be inserted and 
from which items may be deleted at one end, called the top of the stack. We can picture 
a stack as in Figure 2.1.1. ; 

Unlike that of the array, the definition of the stack provides for the insertion and 
deletion of items, so that a’stack is a dynamic, constantly changing object. The question 
therefore arises, how ‘does a stack change? The definition specifies that a single end 
of the stack is designated as the stack top. New items may be put on top of the stack 
(in which case the top of the stack moves upward to correspond to the new highest 
element), or items which are at the top of the stack may be removed (in which case the 
top of the stack moves downward to correspond to the new hi ghest element). To answer 

- the question, which way is up? we must decide which end of the stack is designated 
as its top—that is, at which end items are added or deleted. By drawing Figure 2.1.1 
so that F is physically higher on the page than all the other items in the stack, we 
imply that F is the current top element of the stack. If any new items are added to the 
stack, they are-placed on top of F, and if any items are deleted, F isthe first to be deleted: 


77 


Figure 2.1.1 Stack containing stack 
terms. 


i i + ; 


This is also indicated by the vertical lines that extend past the items of the stack in the 
direction of the stack top. 

Figure 2.1.2 is a motion picture of a stack as it expands and shrinks with the 
passage of time. Figure 2.1.2 a shows the stack as it exists at the time of the snapshot of 
Figure 2.1.1. In Figure 2.1.2 b, item G is added to the stack. According to the definition, 
there is only one place on the stack where it can be placed—on the top. The top element 
on the stack is now G. As the motion picture progresses through frames c, d, and e, items 

" 1 and J are successively added onto the stack. Notice that the last item inserted (in 
uus ease J) i» at the top of the stack. Beginning with frame f, however, the stack begins 
to shrink, as first J, then /, H, G, and F are successively removed. At each point, the 
top element is removed, since a deletion can be made only from the top. Item G could 
not be removed from the stack before items J, J, and H were gone. This illustrates the 
most important attribute of a stack, that the last element inserted into a stack is the first 
element deleted. Thus J is deleted before / because J was inserted after Z. For this reason 
a stack is sometimes called a last-in, first-out (or LIFO) list. 

Between frames j and k the stack has stopped shrinking and begins to expand 
again as item K is added. However, this expansion is short-lived, as the stack then 
shrinks to only three items in frame n. 

Note that there is no way to distinguish between frame a and frame i by looking 
at the stack’s state at the two instances. In both cases the stack contains the identical 
items in the same order and has the same stack top. No record is kept on the stack of 
the fact that four items had been pushed and popped in the meantime. Similarly, there 
is no way to distinguish between frames d and f, or jand I. If a record is needed of the 
intermediate items having been on the stack, that record must be kept elsewhere; it does 
not exist within the stack itself. ‘ 

In fact, we have actually taken an extended view of what is really observed in a 
stack. The true picture of a stack is given by a view from the top looking down, rather 
than from a side looking in. Thus, in Figure 2.1.2, there is no perceptible difference 
between frames h and o. In each case the. element at the top is G. Although the stack 
at frame h and the stack at frame o are not equal, the only way to determine this is 
to remove all the elements on both stacks and compare them individually. Although 
we have been'looking at cross sections of stacks to make our understanding clearer, it 


should be noted that this is an added liberty, and there is no real provision for taking 
such a picture. 
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Figure 2.1.2 Motion picture of a stack. 
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Primitive Operations 


The two changes which can be made to a stack are given special names. When 
an item is added to a stack, it is pushed onto the stack, and when an item is removed, it 
is popped from the stack. Given a stack s, and an item i, performing the operation push 
(s, i) adds the item i to the top of stack s. Similarly, the operation pop(s) removes the 
top element and returns it as a function value. Thus the assignment operation 


= pop(s); 


removes the element at the top of s and assigns its value to i. 
For example, if s is the stack of Figure 2.1.2, we performed the operation push 


(s, G) in going from frame a to frame b. We then performed, in turn, the following 
operations: 


push (s,H); (frame (c)) 
push (s,J); (frame (d)) 
push (s,J); (frame (e)) 


pop (s); (frame (f)) 
pop (s); (frame (g)) 
pop (s); (frame (h)) 
pop (s); (frame (1)) 
pop (s); (frame (j)) 
push (s,K); (frame (k)) 
pop (s); (frame (1)). 
pop (s); (frame (m)) 
pop (s); (frame (n)) 


push (s,0); (frame (0)) 


Because of the push operation which addy elements to a stack, a stack is some- 
times called a pushdown list. 

There is no upper limit on the number of items that may be kept in a stack, since 
the definition does not specify how many items are allowed in the coliection. Pushing 
another item onto a stack merely produces a larger collection of items. However, if a 
stack contains a single item and the stack is popped, the resulting stack contains no 
items and is called the empty stack. Although the push operation is applicable to any 
stack, the pop operation cannot be applied to the empty stack because such a stack has 
no elements to pop. Therefore, before applying the pop operator to a stack, we must 
ensyre that the stack is not empty. The operation empry(s) determines whether or not a 
stack s is empty. If the stack is empty, empty (s) returns the value TRUE; otherwise it 
returns the value FALSE. 

Another operation that can be performed on a stack is to determine what the top 
item on a stack is without removing it. This operation is written stacktop(s) and returns 


_ the top element of stack s. The operation stacktop(s) is not really a new operation, since 


it'can be decomposed into a pop and a push. 


= stacktop(s); 
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is equivalent to 


i = pop(s); 
push (s,i); 


Like the operation pop, stacktop is not defined for an empty stack. The result of an illegal 
attempt to pop or-access an item from an empty stack is calle underflow. Underflow 
can be avoided by ensuring that empty(s) is false before attempting the operation pop(s) 
or stacktop(s). “8 


Example 


Now that we have defined a stack and have indicated the operations that can be 
performed on it, let us see how we may use the stack in problem solving. Consider a 
mathematical expression that includes several sets of nested parentheses: for example, 


7-(XK* (X+N/ -3)) +N / (4 - 2.5)) 


We want to ensure that the parentheses are nested correctly; that is, we want to check | 
that 


1. There are an equal number of right and left parentheses. 
2. Every right parenthesis is preceded by a matching left parenthesis, 


Expressions such as 
((A + B) or A+ B( 


violate condition 1, and expressions such as 


)A + B(-C or (A+ B))-(C+D 
violate condition 2. 

To solve this problem, think of each left parenthesis as opening a scope and each 
right parenthesis as closing a scope. The nesting depth at a Particular point in an ex- 
pression is the number of scopes that have been opened but not yet closed at that point. 
This is the same as the number of left Parentheses encountered whose matching right 
parentheses have not yet been encountered. Let us define the parenthesis count at a 
particular point in an expression as the number of left parentheses minus the number 
of right parentheses that have been encountered in scanning the expression from its left 
end up to that particular point. If the parenthesis count is nonnegative, it is the same as 


the nesting depth. The two conditions that must hold if the parentheses in an expression 
form an admissible pattern are as follows: 


1. The parenthesis count at the end of the expression is Q. This implies that no scopes 
have been left open or that exactly as many right parentheses as left parentheses 
have been found. f P 

2. The parenthesis count at each point in the expression is nonnegative. This implies 
that no right parenthesis is encountered for which a matching left parenthesis had 
not previously been encountered. 
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Figure 2.1.3 Parenthesis count at various points of strings. 


In Figure 2.1.3 the count at each point in each of the previous five strings is given 
directly bélow that point. Since only the first string meets the foregoing two conditions, 
it is the only one among the five with a correct parentheses pattern. ; 

Let us now change the problem slightly and assume that three different types 
of scope delimiters exist. These types are indicated by parentheses ((and)), brackets 


([and]), and braces ({ and }). A Scope ender must be of the same type as its scope opener. 
Thus, strings such as 


(A + B], [(A + B)), {A — (B}} 

are illegal. ` 
It is necessary to keep track of not only how many scopes have been opened but 
also of their types. This information is needed because when a scope ender is encoun- 
tered, we must know the symbol with which the scope was opened to ensure that it is 
being closed properly. 

A stack may be used to keep track of the types of scopes encountered. Whenever 
a scope opener is encountered, it is pushed onto the stack. Whenever a scope ender is 
encountered, the stack is examined. If the stack is empty, the scope ender does not have 
a matching opener and the string is therefore invalid. If, however, the stack is nonempty, 
we pop the stack and check whether the popped item corresponds to the scope ender. 
If a match occurs, we continue. If it does not, the string is invalid. When the end of the 
string is reached, the stack. must be empty; otherwise one or more scopes have been 
opened which have not been closed, and the string is invalid. The algorithm for this 
Procedure follows. Figure 2.1.4 shows the state of the stack after reading in parts of the 
string {x +(y— [a + b])*c-[d+e)} Mh-(G-k- {! — n)))). 


valid = true; /* assume the string is valid */ 
s = the empty stack; 
while (we have not read the entire string) { 
read the next symbol (symb) of the string; 
if (symb == '(' || symb == '[' || symb == '{') 
push(s, symb) ; 
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Oss {x+(... (x+(y-[... 


d 


(x+ {y=a+b]... © {x> (y #[a +b)... (x#(y-[a+b})ee-[(... 


, [x+(y-la+b})*c-[(d+e)])... (x + (y-[a+b])*e—[(d + 6) ])}Ah-(j-(k-[.-. 


(x+ (y-[a+b])*e-[(d +0) ))(h-U-(k-[I-n}))... (e+ (y-la +b]. c-i (d-+0)1}Ah=(- w- l-a} 


Figure 2.1.4 Parenthesis stack at various stages of processing. 
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if (symb == ')' || symb == ')' || symb == '{') 
if (empty(s)) 
valid = false; 
else { 
= pop(s); 
if (i is not the matching opener of symb) 
valid = false; 
} /* end else */ 
} /* end while */ 
if (!empty(s)) 
valid = false; 


if (valid) 


printf("%s", "the string is valid"); 
else 


printf("%s", “the string is invalid"); 


Let us see why the solution to this problem calls for the use of a stack. The last 
scope tobe opened must be the first to be closed. This is simulated by a stack in which 
the last element arriving is the first to leave. Each item on the stack represents a scope 
that has been opened but that has not yet been closed. Pushing-an item onto the stack 
corresponds to opening a scope, and popping an item from the stack corresponds to 
closing a scope, leaving one less scope open. 

Notice the correspondence between the number of elements on the stack in this 
example and the parenthesis count in the previous example. When the stack is empty 
(parenthesis count equals 0) and a scope ender is encountered, an attempt is being made 
to close a scope which has never been opened, so that the parenthesis pattern is invalid. 
In the first example, this is indicated by a negative parenthesis count, and in the second 
example by an inability to pop the stack. The reason that a simple parenthesis count 
is inadequate for the second example is that we must keep track of the actual scope 
openers themselves. This can be done by the use of a stack. Notice also that at any 
point, we examine only the element at the top of the stack. The particular configuration 
of parentheses below the top element is irrelevant while examining this top element. It 
is only after the top element has been popped that we concern ourselves with subsequent 
elements in a stack. 

In general a stack can be used in any situation that calls for a ‘iegt-in, first-out 
discipline or that displays a nesting pattern. We shall see more examples of the use of 
stacks in the remaining sections of this chapter and, indeed, throughout the text. 


+The Stack as an Abstract Data Type 
The representation of a stack as:an abstract data type is straightforward. We use 
eltype to denote the type of the Stack element and parameterize the stack type with 
eltype. 


abstract typedef <<e]type>> STACK (eltype); 
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abstract empty(s) 


J3 x! 
STACK(eltype) s; art i r í 
postcondition empty == (len(s), == 0); TEIS 
abstract eltype pop(s) i a aii à Š eh i 
STACK(eltype). S; sur E e hake. a 
precondition empty(s) == FALSE; ae Maps SE, ICN 
postcondition pop == first(s'); Le ALS 

s == sub(s', 1, len(s') - 1); ke an 


abstract push(s, elt) 
STACK(el type) s; 
eitype elt; 
postcondition S$ == <elt>'+ s's) 0 i 
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41.1. 


EEA 


2.1.3. 


2.1.4. 


Use the operations push. pop. stacktop. and, empty to construct operations which do each 
of the following. 


(a) Seti to the second element from the top of the stack, leaving the stack without its 
top two elements. 

(b) Set i to the second element from the top of the stack. leaving the'Stack unchanged. 

(c) Given an integer n, set i to the th element from the top of the stack, leaving: the 
stack without its top n elements. : 

(d) Given an integer n. set / to the nth element from the top of the sack teat the 
stack unchanged. suide } 

te) Seti to the botiom clement of the stack. fenki ing the stack empty- =. } 

(f) Seti tothe bottom clement of the stack. leaving the stack unchanged. (Him: Use 
another, auxiliary stack.) 1K 

(g) Seti to the third element from the bottom ofthe stack. g 

Simulate the action of the algorithm in this section for each of the following strings by 

showing the contents of the stack at each point. 

(a) (A + B}) 

(b) 4{[A + Bl- KC- D)} 

(c) (A+B) -{C + D}-[F+C] . 

td (A) *{([J + KDD 

(e) (((A))) 


Write an algorithm to determine if an input character stil is of the form 


ui 


x ey 


where x is a string consisting of the- letters “A° and *B’, and where y is the reverse of 
x (thatis, ify = “ABABBA,” y must equal “ABBABA”). At each point you may read 
only the next character of the string. 


Write an algorithm to determine if an input character string is of the form 


Dh Dë D.D: 
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where each string a, b, . . ., z is of the form of the string defined in Exercise 2.1.3. (Thus 
a string is in the proper form if it consists of any number of such strings separated by 
the character ‘D".) At each point you may read only the next character of the string. 

2.1.5. Design an algorithm that does not use a stack to read a sequence of push and pop opera- 
tions, and determine whether or not underflow occurs dn some pop operation. Implement 
the algorithm as a C program. 

2.1.6. What set of conditions are necessary and sufficient for a sequence of push and pop op- 
erations on a single stack (initially empty) to leave the stack empty and not cause un- 
derflow? What set of conditions are necessary for such a sequence to leave a nonempty 
stack unchanged? 


2.2 REPRESENTING STACKS IN C 


Before programming a problem solution that uses a stack. we must decide how to rep- 
resent a stack using the data structures that exist in our programming language. As we 
shall see, there are several ways to represent a stack in C. We now consider the simplest 
of these. Throughout this text, you will be introduced to other possible representations. 
Each of them, however, is merely an implementation of the concept introduced in Sec- 
tion 2.1. Each has advantages and disadvantages in terms of how close it comes to 
mirroring the abstract concept of a stack and how much effort must be made by the 
programmer and the computer in using it. f 

A stack is an ordered collection of items, and C already contains a data type that is 
an ordered collection of items: the array. Whenevér a problem solution calls for the use 
of a stack, therefore, it is tempting to begin a program by declaring a variable stack as 
an array. However, a stack and an array are two entirely different things. The number of 
elements in an array is fixed and is assigned by the declaration for the array. In general, 
the user cannot change this number. A stack, on the other hand, is fundamentally a 
dynamic object whose size is constantly changing as items are popped and pushed. 

However, although an array cannot be a stack, it can be the home of a stack. That 
is, an aray can be declared large enough for the maximum size of the stack. During the 
course of program execution, the stack can grow and shrink within the space reserved 
for it. One end of the array is the fixed bottom of the stack, while the top of the stack 
constantly shifts as items are popped and pushed. Thus, another field is needed that, at 
each point during program execution, keeps track of the current position of the top of 
the stack. 

A stack in C may therefore be declared as a structure containing two objects: 
an array to hold the elements of the stack, and an integer to indicate the position of 
the current stack top within the array. This may be done for a stack of integers by the 
declarations 


#define STACKSIZE 100 
struct stack { 

int top; 

int items [STACKSIZE]; 
}; j 
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Once this has been done. an actual stack s may be declared by 


Struct stack s; 


Here, we assume that the elements of the stack s contained in the array s.items are 
integers and that the stack will at no time contain more than STACKSIZE integers. 
In this example STACKSIZE is set to 100 to indicate that the stack can contain 100 
elements (items{0] through items{99}). 

There is, of course, no reason to restrict a stack to contain only integers; items 
could just as easily have been declared as float items|STACKSIZE] or char 
items|STACKSIZE}, or whatever other type we might wish to give to the elements 
of the stack. In fact, should the need arise, a stack can contain objects of different types 
by using C unions. Thus 


#define STACKSIZE 100 
#define INT 1 
#define FLUAT 2 
#define STRING 3 
struct stackelement { 
int etype; /* etype equals INT, FLOAT, or STRING 2y 


/* depending on the type of the */ 
g“ corresponding element. */ 
union { 
int ival; 
float fval; 
char *pval; /* pointer tò a string */ 
} element; - 
}; 
struct stack { 
int top; 
struct. stackelement items [STACKSIZE]; 
} 


defines a stack whose items may be either integers, floating-point numbers, or strings, 
depending on the value of the corresponding etype. Given a stack s declared by 


struct stack s; 


we could print the top element of the stack as follows: 
struct stackelement se; 


se = s.items[s.top]; 

Switch (se.etype) { 
case INTGR : printf("% d\n", se.ival); 
case FLT > printf("% f\n", se.fval); 
Case STRING : printf("% s\n", se.pval); 

} /*end switch */ 
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For simplicity, in the remainder of this section we assume that a stack is declared 
to have only homogeneous elements (so that unions are not necessary). The identifier top 
must atways be declared as an integer, since its value represents the position within the 
array tems of the topmost stack element, Therefore. if the value of s.top is 4, there are 
tive elements on the stack: s. items|0). s.items{1}, s.items{2), s. items(3], and s.items|4}. 
When the stack is popped. the value of s.top is changed to 3 to indicate that there are _ 
now only four elements on the stack and that s. items{3] is the top element. On the other 
hand, if a new object is pushed onto the stack. the value of s.top must be increased by 
1 to 5 and the new object inserted into s.items[5]. 

The empty stack contains no elements and can therefore be indicated by top 
equalling —1. To initialize a stack s to the empty state, we may initially execute 
Stop = —1:. 

To determine, during the course of execution, whether or not a stack is empty the 
condition s.top = = —] may be tested in an if statement as follows: 


if (s.top == -1) 
/* stack is empty */ 

else : ; 

/* stack is not empty */ 


This test corresponds to the operation empry(s) that was introduced in Section 2.1. Al- 
ternatively, we may write a function that returns TRUE if the stack is empty and FALSE 
if it is not empty, as follows: 


int empty(struct stack “ps) 
t 
if (ps->top == -1) 
return(TRUE); 
else 
return(FALSE); 
} /* end empty */ 


Once this function exists, a test for the empty stack is implemented by the statement 


if (empty (&s)) 
/* stack is empty */ 
else 


/* stack is not empty */ 
Note the difference between the syntax of the call to empty in the algorithm of 
Section 2.1 and in the program segment here. In the algorithm, s represented a stack 
and the call to empty was expressed as 


empty(s) 


In this section, we are concerned with the actua! implementation of the stack and 
its’operations. Since parameters in C are passed by value, the only way to modify the 
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argument passed to a function is to pass the address of the argument rather than the 
argument itself. Further, the original definition of C (by Kernighan-Ritchie) and many 
older C compilers do not allow a structure to be passed as an argument even if its value 
remains unchanged. (Although this restriction has been omitted in ANSI C, it is gener- 
ally more efficient to pass a pointer when the structure is large.) Thus in functions such 
as pop and push (which modify their structure arguments), as well as empry (which does 
not), we adopt the convention that we pass the address of the stack structure, rather than 
the stack itself. ten etd. manent hard Aes} 

You may wonder why we bother to define the function empty when we could just 
as easily write if s.top = = — 1 each time that we want to test for the.empty condition. 
The answer is that we wish to make our programs more comprehensible and to make 
the use of a stack independent of its implementation. Once we understand the stack 
concept, the phrase “empry(&s)” is more meaningful than the phrase “s.top = =, ~1." 
If we should later introduce a better implementation of a stack, so that “s.top = =\—1” 
becomes meaningless, we would have to change every reference to the field identifier 
s.top throughout the entire program. On the other hand, the phrase “empry(&s)" would 
still retain its meaning, since it is an inherent attribute of the stack concept rather than 
of an implementation of that concept. All that would be required to revise a program 
to accommodate a new implementation of the stack would be a possible revision of the 
declaration of the structure stack in the main program and the rewriting of the function 
empry. (It is also possible that the form of the call to empty would have to be modified 
so that it does not use an address.) ` 

Aggregating the set of implementation-dependent trouble spots into small, easily 
identifiable units is an important method of making a program more understandable and 
modifiable. This concept is known as modularization, in which individual functions are 
isolated into low-level modules whose properties are easily verifiable. These low-level 
modules can then be used by more complex routines, which do not have to concern 
themselves with the details of the low-level modules but only with their function. The 
complex routines may themselves then be viewed as modules by still higher-level rou- 
tines that use them independently of their internal details. 

A programmer should always be concerned with the readability of the code he or 
she produces. A small amount of attention to clarity will save a large amount of time 
in debugging. Large- and medium-sized programs will almost never be correct the first 
time they are run. If precautions are taken at the time that a program is written to ensure 
that it is easily modifiable and comprehensible, the total time needed to get the program 
to run correctly is reduced sharply. For example, the if statement in the empry function 
could be replaced by the shorter, more efficient statement 


return (ps->top == -1); 
This statement is precisely equivalent to the longer statement 
if (ps->top == -1) 


return(TRUE); 
else return(FALSE); 


Sec. 2.2 Representing Stacks in C j 89 


90 


This is because the value of the expression ps— >top == —1 is TRUE if and only if the 
condition ps- >top == —1 is TRUE. However, someone who reads a program will 
probably be much more comfortable reading the if statement. Often you will find that 
if you use “tricks” of the language in writing programs, you will be unable to decipher 
your Own programs after putting them aside for a day or two. 

Although it is true that the C programmer is often concerned with economy of 
code, it is also important to consider the time that will no doubt be spent in debugging. 
The mature professional (whether in C or other language) is constantly concerned with 
the proper balance between code economy and code clarity. 


Implementing the pop Operation 


The possibility of underflow must be considered in implementing the pop opera- 
tion, since the user may inadvertently attempt to pop an element from an empty stack. 
Of course, such an attempt is iliegal and should be avoided. However. if such an attempt 
should be made the user should be informed of the underflow condition. We therefore 
introduce a function pop that performs the following three actions: 


1. If the stack is empty. print a warning message and halt execution. 
2. Remove the top element from the stack. ~- 7 
3. Return this element to the calling program. 


We assume that the stack consists of integers, so that the pop operation can be imple- 
mented as a function. This would also be the case if the stack consisted of some other 
type of simple variable. However, if a stack consists of a more complex structure (for 
example. a structure or a union), the pop operation would either be implemented as 
returning a pointer to a data element of the proper type (rather than the data element 
itself), or the operation would be implemented with the popped value as a parameter 
Gn which case the address of the parameter would be passed rather than the parameter, 
so that the pop function could modify the actual argument). 


int pop(struct stack *ps) 


if (empty(ps)) { 
printf("%", "stack underflow"); 
exit(1); 
} /* end if */ 
return(ps->items[ps->top--]); 

} /* end pop */ 

Note that ps is already a pointer to a structure of type stack; therefore. the address 
operator “&™ is not used in calling empty. In all applications in C, one must always 
distinguish between pointers and actual data objects. 

Let us look at the pop function more closely. If the stack is not empty, the top 
element of the stack is retained as the returned value. This element is then removed 
from the stack by the expression ps —> top—~. Assume that when pop is called, 
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ps -> top equals 87; that is, there are 88 items on the stack. The value of ps -> items{87] 
is returned, and the value of ps -> top is changed to 86. Note that ps —> items{87] 
still retains its old value; the array ps -> items remains unchangéd by the call to pop. 
However, the stack is modified, since it now contains only 87 elements rather than 88. 
Recall that an array and a stack are two different objects. The array only provides a 
home for the stack. The stack itself contains only those elements between the zeroth 
element of the array and the rapih element. Thus reducing the value of ps -> top by 1 
effectively removes an element from the stack. This is true despite the fact that ps —> 
items[87] retains its old value. 


To use the pop function, the programmer can declare int x and write 


x = pop (&s); 


x then contains tne value popped from the stack. If the intent of the pop operation is not 
to retrieve the element on the top of the stack but only to remove it from the stack, the 
value of x will not be used again in the program. 

Of course, the programmer should ensure that the stack is not empty when the 


pop operation is called. If the programmer is unsure of the state of the stack, its status 
may be determined by coding 


if Clempty(&s)) 


x = pop (ês); 
else 


/* take remedial action */ 


If the programmer unwittingly does call pop with an empty stack, the function 
prints the error message stack underflow and execution halts. Although this is an un- 
fortunate state of affairs, it is far better than what would occur had the if statement in 
the pop routine been omitted entirely. In that case, the value of s.top would be — i and 
an attempt would be made to access the nonexistent element s. items[—1]. 

A programmer should always provide for the almost certain possibility of error. 
This can be done by including diagnostics that are meaningful in the context of the prob- 
lem. By doing so, if and when an error does occur, the programmer is able to pinpoint 
its source and take corrective action immediately. 


Testing for Exceptional Conditions 


Within the context of a given problem, it may not be necessary to halt execu- 
tion immediately upon the detection of underflow. Instead, ıt might be more desirable 
for the pop routine to signal the calling: program that an underflow has occurred. The 
calling routine, upon detecting this signal, can take corrective action. Let us call the pro- 


cedure that pops the stack and returns an indication whether underflow has occurred, 
popandtest: 
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void popandtest(struct stack *ps, int *px, int *pund) 
{ 


if (empty(ps)) { 
*pund = TRUE; 
return; 
} /* end if */ 
*pund = FALSE; 
*px = ps->items[ps->top--]; 
return; 
} /* end popandtest */ 


In the calling program the programmer would write 


popandtest(&s, &x, &und); 
if (und) 

/* take corrective action */ 
else 

/* use value of x */ 


Implementing the Push Operation 


Let us now examine the push operation. It seems that this operation should be 
quite easy to implement using the array representation of a stack. A first attempt at a 
push procedure might be the following: 


void push(struct stack “ps, int x) 
{ 
ps->items[++(ps-. top)] = x; 
return; 
} /* end push */ 


This routine makes room for the item x to be pushed onto the stack by incrementing 
slop by 1, and then inserts x into the array s. items. 

The routine directly implements the push operation introduced in Section 2.1. 
Yet, as it stands. it is quite incorrect. It allows a subtle error to creep in, caused by using 
the array representation of the stack. Recall that a stack is a dynamic structure that is 
constantly allowed to grow and shrink and thus change its size. An array, on the other 
hand, is a fixed object of predeterinined size. Thus, it is quite conceivable that a stack 
may outgrow the array that was set aside to contain it. This occurs when the array is 
full, that is, when the stack contains as many elements as the array and an attempt is 
made to push yet another element onto the stack. The result of such an attempt is called 
an overflow. 

Assume that the array s.items is full and that the C push routine is called. Re- 
member that the first array position is 0 and the arbitrary: size (STACKSIZE) chosen 
for the array s.iremy is 100. The full array is then indicated by the condition Sop == 
99, so that position 99 (the 100th element of the array) is the current top of the stack. 
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When push is called, s.fop is increased to 100 and an attempt is made to insert x into 
s.items| 100]. Of course, the upper bound of s. items is 99, so that this attempt at insertion 
results in an unpredictable error, depending on the contents of.the memory location 
following the last array position. An error message may be produced that is unlikely to 
relate to the cause of the error. 


The push procedure must therefore be revised so that it reads as follows: 


void push(struct stack *ps, int x) 
{ \ 
if (ps->top == STACKSIZE-1) { 
“printf("%s", “stack overflow"); 
exit(1); 


else 
9s->items[++(ps->top)] = x; 
return; 
} /* end push */ 


Here, we check whether the array is full before attempting to push another element onto 
the stack. The array is full if ps -> top == stacksize — 1. 

You should again note that if and when overflow is detected in push, execution 
halts immediately after an error message is printed. This action, as in the case of pop, 
may not be the most desirable, It might, in some cases, make more sense for the ca!ling 
routine to invoke the push operation with the instructions 


pushandtest(&s, x, & overflow); 

if (overflow) 
/* overflow has been detected, x was not */ 
/* pushed on stack. take remedial action. */ 


* x was successfully pushed on the stack */ 
/ continue processing. *] 


This allows the calling program to proceed after the call to pushandtest, whether or not 

overflow was detected. The subroutine pushandtest is left as an exercise for the reader. 
5 Although the overflow and underflow conditions are treated similarly in push and 
pop, there is a fundamental difference between them. Underflow indicates that the pop 
operation cannot be performed on the stack and may indicate an error in the algorithm or 
the data. No other implementation or representation of the stack will cure the underflow 
condition. Rather, the entire problem must be rethought. (Of course, an underflow might 
occur as a signal for ending one process and beginning another. But in such a case 
popandtest rather than pop should be used.) á; 

Overflow, however, is not a condition that is MEE SA to a stack as an abstract 
data structure. Abstractly, it is always possible to push an element onto a stack. A stack 
is just an ordered set, and there is no limit to the number of elements that such a set can 
contain. The possibility of overflow is introduced when a stack is implemented by an 
array with only a finite number of elements, thereby prohibiting the Srowth of the stack - 
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beyond that number. It may very well be that the algorithm that the programmer used 
is Correct, just that the implementation of the algorithm did not anticipate that the stack 
would become so large. Thus, in some cases an overflow condition can be corrected by 
changing the value of the constant STACKSIZE so that the array field items contains 
more eiements. There is no need to change the routines Pop or push, since they refer 
to whatever data structure was declared for the type stack in the program declarations. 
Push also refers to the constant STAC. KSIZE, rather than to the actual value 100. 

However, more often than not, an overflow does indicate an error in the program 
that cannot be attributed to a simple lack of space. The program may be in an infinite 
loop in which items are constantly being pushed onto the stack and nothing is ever 
popped. Thus the stack will outgrow the array bound no matter how high that bound is 
set. The programmer should always check that this is not the case before indiscrimi- 
nately raising the array bound. Often the maximum stack size can be determined easily 
from the program and its inputs, so that if the stack does overflow there is probably 
something wrong with the algorithm that the program represents. 

Let us now look at our last Operation on stacks, staektop(s), which returns the 
top element of a stack without removing it from the stack. As noted in the last section, 


stacktop is not really a primitive operation because it can be decomposed into the two 
operations: 


x = pop(s); 
push (s,x); 


However, this is a rather awkward way to retrieve the top element of a stack. Why not 
ignore the decomposition noted above and directly retrieve the proper value? Of course, 
a check for the empty stack and underflow must then be explicitly stated, since the test 
is no longer handled by a call to pop. 

We present a C function stacktop for a stack of integers as follows: 


int stacktop(struct stack *ps) 


if (empty(ps)) { 
printf("%s", "stack underflow"); 
exit(1); 


else 
return(ps->items[ps->top]); 

} /*end stacktop */ 

= K] 

You may wonder why we bother writing a separate routine stacktop when a ref- 
erence to s.items[s.top] would serve just as well. There are several reasons for this. 
First, the routine Stacktop incorporates a test for underflow, so that no mysterious error 
occurs if the stack is empty. Second, it allows the programmer to use a stack without 
worrying about its internal makeup. Third. if a different implementation of a stack is 
introduced. the programmer need not comb through all the places in the program that re- 
fer to s.irems{s.top] to make those references compatible with the new implementation. 
Only the stacktop routine would need to be changed. 
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EXERCISES 


2.2.1. 


2.2.2. 


2.2.3. 
2.2.4. 


2.2.5. 


2.2.6. 


2.2.7. 


2.2.8. 


Write C functions that use the routines presented in this chapter tœimplement the oper- 
ations of Exercise 2.1.1. 


Given a sequence of push and pop operations and an integer representing the size of an 
array in which a stack is to be implemented, design an algorithm to determine whether 
or not overtiow occurs. The algorithm should not use a stack. Implement the algorithm 
as a C program. i 
Implement the algorithms of Exercises 2.1.3 and 2.1.4 as C programs. 

Show how to implement a stack of integers in C by using an array int s|STACKSIZE), 
where s[0} is used to contain the index of the top clement of the stack, and where s| 1] 
through s|STACKSIZE— J] contain the clements on the stack. Write a declaration and 
routines pop, push, empty, popandtest, stacktop, and pushandtest for this implementa- 
tion. 

Implement a stack in C in which each item on the stack is a varying number of integers. 
Choose a C data structure for such a stack and design push and pop routines for it. 


Consider a language that does not have arrays but does have stacks as a data type. That 
is, one can declare i 


stack s; 


and the push, pop. popandtest, and stacktop operations arè defined. Show how a one- 
dimensional array can be implemented by using these operations on two stacks. 
Design a method for keeping two stacks within a single linear array Sfspacesize] so that 
neither stack overflows until all of memory is used and an entire stack is never shifted 
to a different location within the array. Write C routines pushl. push2. popl and pop2 
to manipulate the two stacks. (Hint: The two stacks grow toward each other.) 

The Bashemin Parking Garage contains a single lane that holds up tc ten cars. There is 
only a single entrance/exit to the garage at one end of the tane. If a customer arrives to 
pick up a car that is not nearest the exit, all cars blocking its path are moved out, the 
customer's car is driven out, and the other cars are restored in the same order that they 
were in originally. Write a program that processes a group of input lines. Each input line 
contains an `A’ for arrival or a `D’ for departure. and a license plate number. Cars are 
assumed to arrive and depart in the order specified by the input. The program should print 
a message whenever a car arrives or departs. When a car arrives. the message should 
specify whether or not there is room for the car in the garage. If there is no room, the car 
leaves without entering the garage. When a car departs, the message-should include the 
number of times that the car was moved out of the garage to allow other cars to depart. 


2.3 EXAMPLE: INFIX, POSTFIX, AND PREFIX 
-Basic Definitions and Examples , 


This section examines a major application that illustrates the different types of 


stacks and the various operations and functions defined upon them. The example is 
also an important topic of computer science in its own right. 
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Consider the sum of A and B. We think of applying the operator “+” to-the 
operands A and B and write the sum as A + B. This particular representation is called 
infix. There are two alternate notations for expressing the sum of A and B using the 
symbols A, B, and +. These are 


+AB prefix 
AB+ postfix 


The prefixes “pre-,” “post-,” and “in-” refer to the relative position of the oper- 
ator with respect to the two operands. In prefix notation the operator precedes the two. 
operands, in postfix notation the operator follows the two operands, and in infix notation 
the operator is between the two operands. The prefix and postfix notations are not really 
as awkward to use as they might at first appear. For example, a C function to return the 
sum of the two arguments A and B is invoked by add(A, B). The operator add precedes 
the operands A and B. 

Let us now consider some additional examples. The evaluation of the expression 
A + B x C, as written in standard infix notation, requires knowledge of which of the 
two operations, + or *, is to be performed first. In the case of + and * we “know” 
that multiplication is to be done before addition (in the absence of parentheses to the 
contrary). Thus A + B * Cis interpreted as A + (Bx C ) unless otherwise specified. 
We say that multiplication takes precedence over addition. Suppose that we want to 
rewrite A + B * C in postfix. Applying the rules of precedence, we first convert the 
portie of the expression that is evaluated first, namely the multiplication. By doing 
this conversion in stages we obtain 


A+(B*C) parentheses for emphasis 
A + (BC *) convert the multiplication 
A (BC *) + convert the addition 


ABC * +. postfix form 


The only rules to remember during the conversion process are that operations with 
highest precedence are converted first and that after a portion of the expression has been 
converted to postfix it is to be treated as a single operand. Consider the same example 
with the precedence of operators reversed by the deliberate insertion of parentheses. 


(A +B)*«C infix form 

(AB +)*C convert the addition 

(AB +) C x convert the multiplication 
AB +C- postfix form 


In this example the addition is converted before the multiplication because of the paren- 
theses. In going from (A + B) * C to (AB+) * C, A and B are the operands and + is the 
operator. In going from (AB +) * Cto (AB +)C *, (AB +) and C are the operands and * 
is the operator. The rules for converting from infix to postfix are simple, providing that 
you know the order of precedence. 

We consider five binary operations; addition, subtraction, multiplication, division, 
and exponentiation. The first four are available in C and are denoted by the usual op- 
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erators +, —, *, and /. The fifth, exponentiation, is represented by the operator $. The 
value of the expression A $ B is A raised to the B power, so that 3 $ 2 is 9. For these 
binary operators the following is the order of precedence (highest to lowest): - 


Exponentiation 
Multiplication/division 
Addition/subtraction 


i When unparenthesized operators of the same precedence are scanned, the order 
is assumed to be left to right except in the case of exponentiation, where, the order is 
assumed to be from right to left. Thus A + B + C means (A + B) + C, whereasA $B 
$ C means A $ (B $ C ). By us‘ng parentheses we can override the default precedence. 
We give the following additional examples of converting from infix to postfix. Be 
sure that you understand each of these examples (and can do them on your own) before 
proceeding to the remainder of this section. 


Infix Postfix 
A+B AB + 
A+B-C AB+C- 
(A + B)*(C - D) AB + CD — * s 


A$B*C—D+E/F/(G+H) AB $C * D — EF/GH +/+ 
(A + B)*C-— (D +E) $ (F + G) AB + C*DE-- FG+$ 
A — BAC*D$E) ABCDE $ */ — 
The precedence rules for converting an expression from infix to prefix are iden- 
tical. The only change from postfix conversion is that the operator is placed before the 


operands rather than after them. We present the prefix forms of the foregoing expres- 
sions. Again, you should attempt to make the transformations on your own, 


Infix Prefix 
A+B + AB 
A+B-C - + ABC 
(A + B) *(C — D) * + AB — CD 
A$B*C-D+E/F/(G +H) +- +$ ABCD//EF + GH 
((A + B)*C — (D — E) $ (F + G) $ — » + ABC —- DE + FG 
A — B/(C * D $ E) -A/B*C$DE 


Note that the prefix form of a complex expression is not the mirror image of the postfix 
form, as can be seen from the second of the foregoing examples, A + B — C. We will 
henceforth consider only postfix transformations and leave to the reader as exercises 
most of the work involving prefix. 

One point immediately obvious about the postfix form of an expression is that it 
requires no parentheses. Consider the two expressions A + (B * C) and (A + B) * C. 
Although the parentheses in one of the two expressions is superfluous [by convention 
A+B*C =A + (B * C)]), the parentheses in the second expression are necessary to 
avoid confusion with the first. The postfix forms of these expressions are 


Sec. 2.3 Example: Infix, Postfix, and Prefix 97 


Infix Postfix 
A + (B *C) ABC * + 
(A+B)*C |. AB+Cx 


There are no parentheses in either of the two transformed expressions. The order 
of the operators in the postfix expressions determines the actual order of operations in 
evaluating the expression, making the use of parentheses unnecessary. 

In going from infix to postfix we sacrifice the ability to note. at a glance the 
operands associated with a particular operator. We gain, however, an unambiguous form 
of the original expression without the use of cumbersome parentheses. In fact, the post- 
fix form of the original expression might look simpler were it not for the fact that it 
appears difficult to evaluate. For example, how do we know that if A = 3, B = 4, and 
C = 5 in the foregoing examples, then 3 4 5 * + equals 23 and 3 4 + 5 * equals 35? 


Evaluating a Postfix Expression 


The answer to the foregoing question lies in the development of an algorithm for 
evaluatihg expressions in postfix. Each operator in a postfix string refers to the previous 
two operands in the string. (Of course, one of these two operands may itself be the result 
of applying a previous operator.) Suppose that each time we read an operand we push 
it onto a stack. When we reach an operator, its operands will be the top two elements 
on the stack. We can then pop these two elements, perform the indicated operation on 
them, and push the result on the stack so that it will be available for use as an operand 


of the next operator. The following algorithm evaluates an expression in postfix using 
this method: 


opndstk = the empty stack; 
/* scan the input string reading one */ 
/* element at a time into symb */ 
while (not end of input) { 
symb = next input character; 
if (symb is an operand) 
push(opndstk, symb); 
else { 
/* symb is an operator */ 
opnd2 = pop(opndstk) ; 
opnd1 = pop(opndstk); 
value = result of applying symb to opndl and opnd2; 
push(opndstk, value); 
+» } /* end else */ : 
} /* end while */ 
return(pop( opndstk)); 


` Let us now consider an example. Suppose that we are asked to evaluate the fol- 
lowing postfix expression: 


GR 2 Shek tes R Di Ate 2 Seid. 


- We show the contents of the stack opndstk and the variables symb, opnd1, opnd2, and 
value after each successive iteration of the loop. The top of opndstk is to the right. 
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Set fe Le eS LL 
symb opnd\ opnd2 value opndstk 


eee 


6 6 

2 : 6,2 

3 6,2,3 
+ 2 3 5 6.5 
- 6 5 1 1 

3 6 5 1 1,3 

8 6 5 1 1,3,8 
2 6 5 1 1,3,8,2 
Í 8 2 4 1,3,4 
+ 3 4 7 1,7 

* 1 Ts 7 7 

2 1 7 7 7,2 
$ 7 2 49 49 

3 7 2 49 49,3 
+ 49 3 52 52 


Each operand is pushed onto the operand stack as it is encountered. Therefore the 
maximum size of the stack is the number of operands that appear in the input expression. 
However, in dealing with most postfix expressions the actual size of the stack needed is 
less than this theoretical maximum, since an operator removes operands from the stack. 
In the previous example the stack neve” contained more than four elements, despite the 
fact that eight operands appeared in the postfix expression. 


Program to Evaluate a Postfix Expression 


There are a number of questions we must consider before we can actually write a. 
program to evaluate an expression in postfix notation. A primary consideration, as in all 
programs, is to define precisely the form and restrictions, if any, on the input. Usually 
the programmer is presented with the form of the input and is required to design a 
program to accommodate the given data. On the other hand, we are in the fortunate 
position of being able to choose the form of our input. This enables us to construct 
a program that is not overburdened with transformation problems that overshadow the 
actual intent of the routine. Had we been confronted with data in a form that is awkward 
and cumbersome to work with, we could have relegated the transformations to various 
functions and used the output of these functions as input to our primary routine. In the 
“real world,” recognition and transformation of input is a major concern. 

Let us assume in this case that each input line is in the form of a string of digits and 
operator symbols. We assume that operands are single nonnegative digits, for example, 
0, 1, 2, ..., 8, 9. For example, an input line might contain 3 4 5 * + in the first 5 
columns followed by an end-of-line character (‘\ n°). We would like to write a program 
that reads input lines of this format. as long as there are any remaining, and prints for 
each line the original input string and the result of the evaluated expression. 

Since the symbols are read as characters, we must find a method to convert the 
operand characters to numbers and the operator characters to operations. For example, 


we must have a method for converting the character ‘5° to the number 5 and the char- 
acter ‘+` to the addition operation. 
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The conversion of a character to an integer can be handled easily in C, If int x 
is a single digit character in C, the expression x — ‘O° yields the numerical value of 
that digit. To implement the operation Corresponding to an operator symbol, we use a 
function oper that accepts the character representation of an operator and two operands 
as input parameters, and returns the value of the expression obtained by applying the 
operator to the two operands. The body of the function will be presented shortly. 

The body of the main program might be the following. The constant MAXCOLS 
is the maximum number of columns in an input line. 


#include <stdio.h> 
#include <stdlib.h> 
#include <math.h> 


#define MAXCOLS 80 
#define TRUE 1 
#define FALSE 0 


double eval(char[]); 

double pop(struct stack *); 

void push(struct stack *, double); 
int empty(struct stack *); 

int isdigit(char); 

double oper(int, double, double); 


void main() 


char expr[MAXCOLS]; 
int position = 0; 


while((expr(position++]. = getchar()) != '\n') 
expr[--position} = '\0';* 7 
printf("%s%s", "the original postfix expression is", expr); 


printf("\nxf", eval(expr)); 
} /* end main */ 


The main part of the program is, of course, the function eval, which follows. That 
function is merely the C implementation of the evaluation algorithm, taking into account 
the specific environment and format of the input data and calculated outputs. eval calls 
on a function isdigit that determines whether or not its argument is an operand. The. 
declaration for a stack that appears below is used by the function eval that follows it as 
well as by the routines pop and push that are called by eval. 


struct stack { 

int top; i 

double items [MAXCOLS]; 
3; 
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double eval(char expr[]) 

{ 
int c, position; 
double opndl, opnd2, value; 
struct stack opndstk; 


opndstk.top = -1; 
for (position = 0; (c = expr 

if (isdigit(c)) 
/* operand-- convert t 


[positi 


Py 


on}) != '\0'; position+ +) 


he character representation */ 


ř /* of the digit into double and push it onto */ 
s the stack - f */ 
push(&opndstk, (doùble) (c-'0')); 

else { 
ys operator */ 


opnd2 = pop(&opndstk) ; 
opndl = pop(&opndstk) ; 
value = oper(c, opndi, opnd2) 
push(&opndstk, value); 
} /* end else */ 
return(pop(&opndstk)) ; 
} /* end eval */ 


For completeness we present isdigit and oper. The function isdigit simply checks 


if its argument is a digit: 
int isdigit(char symb) 


return(symb >= 'O' && symb <= '9'); 
} 


This function is available as a predefined m 


The function oper checks that its first argument is a valid operator and, 


determines the results of its operation on th 


acro in most C systems. 
if it is, 
e next two arguments. For exponentiation, 


we use the function pow(op], op2) as defined in math-h. 


double oper(int symb, double opl, double op2) 


switch(symb) { 


case '+' : return (opl + op2); 
case '-' : return (opl - op2); 
case '*' : return (opl * op2); 
case '/' : return (opl / op2); 
case '$' : return (pow(opl, op2)); 


default : printf("%s", "illegal 
S exit(1); 
} /* end switch */ 


} /* end oper */ 
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operation"); 
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Limitations of the Program 


Before we leave the program, we should note some of its deficiencies. Under- 
standing what a program cannot do is as important as knowing what it can do. It should 


A major criticism of this Program is that it does nothing in terms of error detec- 
tion and recovery. If the data on each input line represents a valid postfix expression, 
the program works, Suppose, however, that one input line has too Many operators or 
operands or that they are not in a Proper sequence. These problems could come about 
as a result of someone innocently using the program on a postfix expression that con- 


excess number of operators. Depending on the specific type of error, the computer may 
take one of several actions (for example, halt execution or print erroneous results). 
Suppose that at the final statement of the program, the stack opndstk is not empty. 
We get no error messages (because we asked for none), and eval returns a numerical 
value for an expression that was probably incorrectly stated in the first place. Suppose 
that one of the calls to the pop routine raises the wide ow condition. Since we did 


seems unreasonable, since faulty data on one line should not prevent the processing of 
additional lines. By no means are these the only problems that could arise, As exercises, 
you may wish to write programs that accommodate less restrictive inputs and some 


è 
» 


` 
Converting an Expression from Infix to Postfix 


We have thus far presented routines to evaluate a postfixexpression. Although we 
have discussed a method for transforming infix to postfix, we have not as yet presented 
an algorithm for doing so. It is to this task that we now direct our attention. Once such 
an algorithm has been constructed, we will have the capability of reading an infix ex- 
pression and evaluating it by first converting it to postfix and then evaluating the postfix 
expression, 

In our previous discussion we mentioned that expressions within innermost paren- 
theses must first be converted to postfix so that they can then be treated as single 
operands. In this fashion parentheses can be successively eliminated until the entire 
expression is converted. The last pair of parentheses to be opened within a group of 
parentheses encloses the first expression within that group to be transformed. This last- 
in, first-out behavior should immediately suggest the use of a stack. 
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Consider the two infix expressions A + B * Cand (A + B) * C, and their respective 
postfix versions ABC * + and AB + C x. In each case the order of the operands is the 
same as the order of the operands in the original infix expressions. In scanning 
the first expression, A + B * C., the first operand, A, can be inserted immediately 
into the postfix expression. Clearly the + symbol cannot be inserted until after its 
second operand, which has not yet been scanned, is inserted. Therefore, it must be 
stored away to be retrieved and inserted in its proper position. When the operand B is 
scanned, it is inserted immediately after A. Now, however, two operands have been 
scanned. What prevents the symbol + from being retrieved and inserted? The answer 
is, of course, the * symbol that follows, which has precedence over +. In the case of 
the second expression the closing parenthesis indicates that the + operation should be 
performed first. Remember that in postfix, unlike infix, the operator that appears earlier 
in the string is the one that is applied first. i 

Since precedence plays such an important role in transforming infix to. postfix, let 
us assume the existence of a function pred(op1,op2), where op1 and op2 are characters 
representing operators. This function returns TRUE if op] has precedence over op2 
when op! appears to the left of op2 in an infix expression without parentheses. pred 
(op1,op2) retums FALSE otherwise. For example, pred(‘*’,‘+’) and pred(*+’,” +") are 
TRUE, whereas pred(‘+’,**’) is FALSE. . 3 

Let us now present an outline of an algorithm to convert an infix string without 
parentheses into a postfix string. Since we assume no parentheses in the input string, the 
only governor of the order in which operators appear in the postfix string is precedence, 
(The line numbers that appear in the algorithm will be used for future reference.) 


1 opstk = the empty stack; 
2 while (not end of input) { 
3 symb = next input character; 
4 if (symb is an operand) 
_add symb to the postfix string 
5 else { 
6 while({empty(opstk) && prcd(stacktop(opstk), symb)) { 
7 topsymb = pop(opstk); 4 
8 add topsymb to the postfix string; 
} /* end while */ 
9 push(opstk, symb); 
} /* end else */ 
} /* end while */ 
/* output any remaining operators */ 
10 while (!empty(opstk)) { 
11 topsymb = pop(opstk); 
12 add topsymb to the postfix string; 
} /* end while */ 


Simulate the algorithm with such infix strings as “A * B + C * D” and “A + B 
«C$D$E” [where ‘$’ represents exponentiation and pred (‘$", ‘$°) equals FALSE | to 


é 
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convince yourself that it is correct. Note that at each point of the simulation, an operator 
on the stack has a lower Precedence than all the operators above it. This is because the 
intial empty stack trivially satisfies this condition, and an operator is pushed onto the 
stack (line 9) only if the Operator currently on top of the stack has a lower precedence 
than the incoming operator. : 

What modification must be made to this algorithm to accommodate parenthe- 
ses? The answer is, surprisingly little. When an opening parenthesis is read, it must be 
pushed onto the stack. This can be done by establishing the convention that pred(op,‘(’) 
equals FALSE, for any operator symbol op other than a right parenthesis. In addition, 
we define pred(‘( *,0p) to be FALSE for any operator symbol op. [The case of op==')’ 
will be discussed shortly.] This ensures that an operator symbol appearing after a left 
parenthesis is pushed onto the stack: 

When a closing parenthesis is read, all operators up to the first opening parenthesis 
must be popped from the stack into the postfix string. This can be done by defining 
pred(op, ‘)’) as TRUE for all Operators op other than a left parenthesis, When these 


parenthesis, the loop beginning af line 6 is skipped, so that the opening parenthesis is 
not inserted into the postfix string. Execution therefore Proceeds to line 9. However, 
Since the closing parenthesis should not be pushed onto the stack, line 9 is replaced by 
the statement 


9 if (empty(opstk) || symb != ')') 
push(opstk, symb); 
else /* pop the open parenthesis and discard it */ 
topsymb = pop(opstk); : 


‘With the foregoing conventions for the pred function and the revision to line 9; 
ihe algorithm can be used to convert any infix string to postfix. We summarize the 
precedence rules for parentheses: 


prcd('(',op) = FALSE for any operator op 

prcd(op,'(') = FALSE for any operator op other than ')' 

prcd(op,')') = TRUE for any operator op other than '(' 

prcd(")',op) = undefined for any operator op (an attempt 
to compare the two indicates an 
error). 


We illustrate this algorithm on some examples:. 
Example 1: A+ Bx Ç 


The contents of symb, the postfix string, and opstk are shown after scanning each 
symbol. opstk is shown with its top to the right. 
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symb postfix string opstk 


l A A 

2 + A + 
3 B ..: AB + 
4 « % AB +* 
5 C ABC +* 
6 ABC * + 
7 ABC * + 


Lines 1, 3, and 5 correspond to the scanning of an operand; therefore the symbol (symb) 
is immediately placed on the postfix string. In line 2 an operator is scanned and the 
„stack is found to be empty, and the operator is therefore placed on the stack. In line 4 
the precedence of the new symbol (*) is greater than the precedence of the symbol on 
the top of the stack (+); therefore the new symbol is pushed onto the stack. In steps 6 


and 7 the input string is empty, and the stack is therefore popped and its contents are 
placed on the postfix string. 


Example 2: (A +B)*C 


symb postfix string . opstk 


( 
A 5 ( 
A ( 
AB ( 
AB+ 

AB+ * 
AB+C 
AB+C* 


A*—~ B+ BA 


In this example, when the right parenthesis is encountered the stack is popped 
until a left parenthesis is encountered, at which point both parentheses are discarded. 
By using parentheses to force an ordet of precedence different than the default, the 
order of appearance of the operators in the postfix string is different than in example 1. 


Example 3: ((A — (B + C)) * D) $ (E + F) (See example 3 on top of page 106.) 


Why does the conversion algorithm seem so involved, whereas the evaluation al- 
gorithm seems so simple? The answer is that the former converts from one order of 
precedence (governed by the prcd function and the presence of parentheses) to the nat- 
ural order (that is, the operation to be executed first appears first). Because of the many 
combinations. of elements at the top of the stack (if not empty) and possible incoming 
symbol, a large number of statements are necessary to ensure that every possibility is 
covered. In the latter algorithm, on the other hand, the operators appear in precisely 
the order they are to be executed. For this reason the operands can be stacked until an 
operator is found, at which point the operation is performed immediately. 
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EXAMPLE 3 


symb postfix string opstk 
( ( 
( (( 
A A (( 
= A t= 
( A Ge=£ 
B AB (C= ( 
+ AB (¢- (+ 
C ABC ((=(+ 
) ABC + ((- 
) ABC + - ( 
* ABC + - (* 
D ABC + -D (* 
) ABC + -D * 
$ ABC + —-D* $ 
( ABC + —-D* $ ( 
E ABC + -D*E $( 
+ ABC + -D*E $(+ 
F ABC + -D* EF $( + 
) ABC + —D* EF + $ 


ABC + -D*EF +$ 


The motivation behind the conversion algorithm is the desire to output the oper- 
ators in the order in which they are to be executed. In solving this problem by hand 
we could follow vague instructions that require us to convert from the inside out. This 
works very well for humans doing a problem with pencil and paper (if they do not be- 
come confused or make a mistake). However, a program or an algorithm must be more 
precise in its instructions. We cannot be sure that we have reached the innermost paren- 
theses or the operator with the highest precedence until additional symbols have been 
scanned. At the time, we must backtrack to some previous point. 

Rather than backtrack continuously, we make use of the stack to “remember” the 
operators encountered previously. If an incoming operator is of greater precedence than 
the one on top of the stack, this new operator is pushed onto the stack. This means that 
when all the elements in the stack are finally popped, this new operator will precede 
the former top in the postfix string (which is correct since it has higher precedence). 
If, on the other hand, the precedence of the new operator is less than that of the top of 
the stack, the operator at the top of the stack should be executed first. Therefore the top 
of the stack is popped and the incoming symbol is compared with the .new top, and so 
on. Parentheses in the input string override the order of operations. Thus when a left 
parenthesis i is scanned, it is pushed on the stack. When its associated right capi 
is found, all the operators between the two parentheses are placed on the output string 
because they are to be executed before any operators appearing dfter the parentheses. 


Program to Convert an Expression from infix to Postfix 


There are two things that we must do before we actually start writing a pro- 
gram. The first is to define precisely the format of the input and output. The second is 
to construct, or at least define, those routines that the main routine denends upon. We 
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assume that the input consists of strings of characters, one string per input line. The 
end of each string is signaled by the occurrence of an end-of-line character (*\ n’). 
For the sake of simplicity, we assume that all operands are single-character letters or 
digits. All operators and parentheses are represented by themselves, and ‘$ ` represents 
exponentiation. The output is a character string. These conventions make the output of 
the conversion process suitable for the evaluation process, provided that all the single 
character operands in the initial infix string are digits. 

In transforming the conversion algorithm into a program, we make use of several 
routines. Among these are empty, pop, push and popandtest, all suitably modified so 
that the elements on the stack are characters. We also make use of a function isoperand 
that returns TRUE if its argument is an operand and FALSE otherwise. This simple 
function is left to the reader. 

` Similarly, the pred function is left to the reader as an exercise. It accepts two 
single-character operator symbols as arguments and returns TRUE if the first has prece- 
dence over the second when it appears to the left of the second in an infix string and 
FALSE otherwise. The function should, of course, incorporate the parentheses conven- 
tions previously introduced. = 

Once these auxiliary functions have been written, we can write the conversion 
function postfix and a program that calls it. The program reads a line containing an 


expression, in infix, calls the routine postfix, and prints the postfix string. The body of 
the main routine follows: 


#include <stdio.h> 
#include <stdlib.h> 


#define MAXCOLS 80 
#define TRUE 1 
#define FALSE 0 


void postfix(char *, char *); 

int isoperand(char) ; 

void popandtest(struct stack *, char *, int *); 

int pred(char, char); ‘ 
void push(struct stack *, char); 

char pop(struct stack *); 


void main() 

{ 
char infix[MAXCOLS]; 
char postr[MAXCOLS]; 
int pos = 0; 


while ((infix[pos++] = getchar()) != '\n'); 
infix[--pos] = '\0'; 
printf("%s%s", "the original infix expression is ", infix); 
postfix(infix, postr); 
printf("%s\n", postr); 
} /* end main */ 
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The declaration for the operator stack and the postfix routine follows: 
$ 


struct stack { 

int top; 

char items [MAXCOLS); 
}; 


postfix(char infix[], char postr[}) | 
{ , / 
int position, und; 
int outpos = 0; 
char topsymb = '+'; 
char symb; 
struct stack opstk; 
opstk.top = -1; /* the empty stack */ 


for (position=0; (symb = infix[position]) != '\0'; position++) 
if (isoperand(symb)) 
postr[outpos++] = symb; 
else { 
popandtest(&opstk, &topsymb, Gund); 
while (‘und & prcd(topsymb, symb)) { 
postr[outpos++] = topsymb; 
popandtest(&opstk, &topsymb, &und); 
} /* end while */ 
if (lund) 
push(&opstk, topsymb) ; 
if (und || (symb != ')')) 
push(&opstk, symb); 
else 
topsymb = pop(&opstk) ; 
} /* end else */ 
while (!empty(&opstk)) 
postr[outpos++] = pop(&opstk); 
postr[outpos] = '\0'; 
return; 


} /* end postfix */ 
ra 

The program has one major flaw in that it does not check that the input string is a 
valid infix expression. In fact, it would be instructive for you to examine the operation 
of this program when it is presented with a valid postfix string as input. As an exercise 
you are asked to write a program that checks whether or not an input string is a valid 
infix expression. 

We can now write a program to read an infix string and compute its numerical 
value. If the original string consists of single-digit operands with no letter operands, the 
following program reads the original string and prints its value. 
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‘#define MAXCOLS 80 

void main() 

{ 
char instring[MAXCOLS], postring[MAXCOLS]; 
int position = 0; 
double eval(); 


while((instring[position++] = getchar()) != '\n') 


' 
instring[--position] = '\0'; 
printf("%s%s\n", “infix expresson is ", instring); 
postfix(instring, postring); 
printf("%s%f\n", "value is ", eval(postring)); 
} /* énd main */ i 


~ » Two different versions of the stack manipulation routines (pop, push, and so forth), 
and associated function prototypes, are required because postfix uses a stack of character 
operators (that is, opstk), whereas eval uses a stack of float operands (that is, opndstk). 
Of course, it is possible to use a single stack that can contain both reals or characters 
by defining a union as described earlier in Section 1.3. 

Most of our attention in this section has been devoted to transformations involv- 
ing postfix expressions. An algorithm to convert an infix expression into postfix scans 
characters from left to right, stacking and unstacking as necessary. If it were necessary 
to convert from infix to prefix, the infix string could be scanned from right to left and the 
appropriate symbols entered in the prefix string from right to left. Since most algebraic 
expressions are read from left to right, postfix is a more natural choice. 

The foregoing programs are merely indicative of the type of routines one could 
write to manipulate and evaluate.postfix expressions. They are by no means compre- 

_hensive or unique. There are many variations of the fdregoing routines that are equally 
acceptable. Some of the early high-level language compilers actually used’routines such 
as eval and postfix to handle algebraic expressions. Since that time, more sophisticated 
techniques have been developed to handle these problems. 


Stacks in C++ Using Templates 


There are a number of drawbacks to the solution that-we just presented. First, 
although two stacks are used in the complete solution (a stack of operators in the postfix 
routine and a stack of operands in the eval routine), only a single stack is used at any one 
time. Nevertheless, in the implementation of the solution that we presented, both stacks 
were created and remained in memory throughout the entire program. Second, because 
the stacks are not of the same type, it is necessary to declare them separately. And with 
the separate declarations, it is necessary to provide separate sets of primitive routines 
(that is, push, pop, empty, etc.). This, in turn, implies that when the implementation of 
a stack is to be changed it must be changed for each type of stack that we have created. 

It would be more efficient if we could design a system around a stack of indeter- 
minate type and define the primitive routines on such a stack. We would then create 
and destroy instances of such a stack as necessary. This would eliminate the need for. 
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us to create separate primitive routines, as well as serve the purpose of allowing us to 
destroy a stack when it is no longer needed. : 

The C+ + feature that supports the definition of an object of undetermined type is 
called a template. Using a template allows the programmer to define the features of the 
class, while reserving the option of binding the type of the class to the class itself until a 
class of a particular type is actually needed. The creation of a class of a particular type 
is called instantiation. 

Let us now consider how we could create a template for stacks-and then illustrate 
how this template could be used in the previous example: accepting an infix string, 
converting the infix string to a postfix string ( using a stack of operators), and then eval- 
uating the postfix string (using a stack of operands). We begin by defining the class that 
we shall use (in our example, the stack); however, this definition is parameterized in the 
snese that it depends on the attributes of a specific paramter. We denote this by using 


template <class T> 


as a prefix to the remainder of the definition of the class. This prefix indicates that 
T is a parameter in the subsequent definition and will vary from one use to another. 
The construct that allows us to create a class of (as yet) undetermined type is called a 
template. i = F iii 

Thus the template definition is as follows: 


template <class T> 
// T is of ordinal type 
class Stack { 


private: ‘ . 
int top; // top points to the next top element 
T *nodes; 

public: i 
Stack (); // default constructor 


int empty (void); 
void push(T &); 
T pop(void); 


T pop(int &); // example of overloading pop to 
// handle the functions of popandtest 
“Stack (); // default destructor 


i 


Within the same file, we would also provide the implementation ofthe stack tem- 
plate. For example, the constructor for Stack would be implemented as follows: 


// Implementation of templates 
template <class T> Stack<T>::Stack () 
{ 


top = -1; 


nodes = new T[STACKSIZE]; 
Ji 
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> 


In the example above, the maximum size of the stack is predetermined to be 
STACKSIZE and the top (a private variable) is initialized to —1. An array of nodes 
of type T is created when the stack is instantiated. Thus a stack of integers would result 
in an array of integers, while a stack of double would result in an array of double. Each 


stack of a particular type would be instantiated with an array of that type. A destructor 
for a stack could be implemented by 


template <class T> Stack<T>::~Stack () 
{ = 


A 


-delete nodes; 


The primitive routines empty, push, and pop are straightforward. (Note that 
we also overload the function pop by inċorporating into it the functionality of 
popandtest.) 


template <class T> int Stack<T>::empty (void) 


{ 
return top> =0; 
J; = 
template <class T> void Stack<T>::push(T & j) 
{ $ 
if (top == STACKSIZE) { 
cout << "Stack overflow" .<< endl; 
return; 
} 
nodes [++top] = j; f 
} 
template <class T> T Stack<T>::pop(void) 
{ : 
Tp; ; 
if (empty ()) { 
cout << "Stack underflow" << endl; 
return p; 
} 
p = nodes[top--]; 
return p; 
}; 


// The tasks of this function were formérly performed’ by popandtest 
template <class T> T Stack<T>::pop(int & und) 
{ 

Tp; 
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if (empty ()) { 
und = 1; 
return p; 

} 

und = 0; 

p = nodes[top--]; . 

return p; 


3 


To make use of the stack template, it would be necessary to include all of the 
prototype definitions above in a file as follows: 


// stackt.h 


#ifndef STACKT_H 
@ #define STACKT_H 


#include <stdio.h> 

#include <stdlib.h> 

#include <alloc.h> 

#include <mem.h> = 
#include <iostream.h> 


#define STACKSIZE 100 = 


These statements would be followed by all of the above (template definition and proto- - 
type functions) and ended with 


#endif 


The above constitute a definition for the template stack. 

The meaning of the expression #ifndef... is that if the operand STACKT_H is not 
already defined, it should be defined here; otherwise, the definition may be bypassed. 
This is a precaution against a double definition (which might occur if there was an 
#include within an #include). 

To make use of a stack of a particular type, it would first be necessary to instan- 
tiate it. This is done simply by declaring a stack of a particular type. By declaring such 
a stack, the stack class will create an instance of a stack of the specified type. Thus we 
could build a file that contains the following routines: 


-#define MAXCOLS 52 
void postfix(char *infix, char *postr); 
int prcd(char opl, char op2); 


int isoperand(char op); 
int isoperator(char op); 
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long double eval(char *postr); 
long double oper(int symb, Tong double opl, long double op2); 


int prced(char opl, char op2) 
// body of pred goes here 


int isoperator(char op) 
// body of isoperator goes here 


int isoperand(char op) 
// body of isoperand goes here 


void ‘postfix(char *infix, char *postr) 
{ 


int position, und; 
int outpos=0; 

char topsymb='+'; 
char symb; 
Stack<char> opstk; 


for (position=0; (symb=infix[position]) != '\0'; position++) { 
if_(isoperand(symb)) 
postr[outpos++]=symb; 
else { 
topsymb=opstk.pop(und) ; 
while (‘und && prcd(topsymb, symb)) { 
postr[outpos++] = topsymb; 
topsymb=opstk. pop(und) ; 


} 
if (‘und) 
opstk.push(topsymb) ; 
if (und || (symb != ')')) 
opstk.push(symb) ; 
else 
topsymb=opstk.pop() ; 


3 /* end for */ 
while (!opstk.empty()) 
postr[outpos++]=opstk.pop(); 
postr[outpos]='\0'; 
} /* end postfix */ 


long double oper(int symb, long double opl, long double op2) 
// body of isoperator goes here 


long double eval(char *postr) 

{ 
int c, position; 
long double opnd1, opnd2, value; 
Stack<long double> opndstk; 
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for (position=0; (c=postr[position]) != '\0'; position++) 

if (isoperand(c)) 

opndstk.push((float) (c-'0')); 

else { ' 
opnd2=opndstk. pop(); 
opndl=opndstk.pop(); 
value=oper(c, opndl, opnd2); 
opndstk.push(value); 
} z 

return (opndstk.pop()); 
} /* end eval */ 


Notice that the statements that instantiate the stacks contain the type of the stack as a 
parameter. Thus the statement 


stack<long double> opndstk; 
within the function eval creates a stack of long double, while the statement 
stack<char> opstk; 
within =_— crea:es a stack of type char. As we mentioned earlier, one set of routines 
for a template class stack is sufficient to manipulate a stack of any type. 


Finally, all of the above could be followed by a main routine: 


void main(void) 


èf 
char 1n[250], post[250]; 
long double res; 
cin >> in; 


cout. << in << endl; 
postfix (in, post); 
res = eval (post); 

cout << res << endl; 


Two points should be noted regarding the method that we just presented. First, the 
set of programs above creates two classes of type stack: a stack of char and a stack of 
long buble. The existence of these two classes is based on declarations that use them. 
Thus. since the stack of characters is present in the postfix routine, a class of that type 
will be created; and because a stack of double long is present in the eval routine. a class 
of that type will be created. The existence of these classes is really independent of the 
existence of any objects of-the Nespective types. Of course, we did declare a stack of 
each type. But you should note that the two stack classes exist regardless of whether or 
not the objects of those types are declared (that is. the routines in which they are created 
may never be called). 
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The second point deals with the destructor. Although we defined a default de- 
structor, we never really called it explicitly. This is because the default destructor is 


called automatically when control is returned from the routine in which an object was 
created. 


EXERCISES 


2.3.3. 


2.3.5. 


Transform each of the following expressions to prefix and postfix. 
(a) A+ B-C 

(b) (A+ B)(C — D)SE*F 

(c) (A+ B)(C$(D-E)+F)-G 

(d) A+((B- C)*(D - E) + F)/G)S(H — J) 

Transform each of the following prefix expressions to infix. 

(a) +- ABC 

(b) +A-BC 

(c) ++A—*$BCD/ + EF» GHI 

(d) +- $ABC + D*+ EFG 

Transform each of the following postfix expressions to infix. 

(a) AB+C- 

(b) ABC+- 

(c) AB-C+DEF-+5$ 

(d) ABCDE -+ $=+*EF*+*- 

Apply the evaluation algorithm in the text to evaluate the following postfix expressions. 
Assume A = 1, B=2, C=3. i 

(a) AB+C-BA+C$- 

(b) ABC + CBA- +*+ 


Modify the routine eval to accept as input a character string of operators and operands 
representing a postfix expression and to create the fully parenthesized infix form of 
the original postfix. For example, AB + would be transformed into (A + B) and AB + 
C — would be transformed into (å + B)—- C). ` 


Write a single program combining the features of eval and postfix to evaluate an infix 
string. Use two stacks, one for operands and the other for operators. Do not first convert 


the infix string to postfix and then evaluate the postfix string, but rather evaluate as you 
go along. 


Write a routine prefix to accept an infix string ang create the prefix form of that string, 


assuming that the string is read from right to left and that the prefix string is created 
from right to left. 


' Write a C program to convert 


(a) A prefix string to postfix 
(b) A postfix string to prefix 
(c) A prefix string to infix 
(d) A postfix string to infix 


Write a C routine reduce that accepts an infix string and forms an equivalent infix string 
with all superfluous parentheses removed. Can this be done without using a stack? 
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2.3.10. 


2.3.11. 


2.3.12. 
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Assume a machine that has a single register and six instructions, 


LD A Places the operand A into the register 

ST A Places the contents of the register into the variable A 

AD A Adds the contents of the variable A to the register 

SB A Subtracts the contents of the variable A from the register 
ML A Multiplies the contents of the register by the variable A 
DV A Divides the contents of the register by the variable A 


Write a program that accepts a postfix expression containing single-letter operands 
and the operators +, =, *, and / and prints a sequence of instructions to evaluate the 
expression and leave the result in the register. Use variables of the form TEMPn as 


temporary variables. For example, using the postfix expression ABC * + DE =/ should 
print the following: 


LD B 
ML C 
ST TEMPL 
LD A 
AD TEMPL 
ST . TEMP? 
LD D 
SB E 
ST TEMP3 
LD TEMP2 
DV TEMP3 
ST TEMP4 


The template definition of a stack can be expanded to allow the size of the stack to be 
a parameter as well. Show how to define a stack template in which each instantiation 
of the stack will have both the element type and the size of the stack asa parameter. 


Can a template be used to store elements of different types on the same stack? Why or 
why not? 
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-Recursion 


This chapter introduces recursion, a programming tool that is one of the most power- 
ful and one of the least uriderstood by beginning students of programming. We define 
recursion, introduce its use in C, and present several examples. We also examine an 
implementation of recursion using stacks. Finally, we discuss the advantages and dis- 
advantages of using recursion in problem solving. 


3.1 RECURSIVE DEFINITION AND PROCESSES 


Many objects in mathematics are defined by presenting a process to produce that object. 
For example. 7 is defined as the ratio of the circumference of a circle to its diameter. 
This is equivalent to the following set of instructions: obtain the circumference of a 
circle and its diameter. divide the former by che latter, and call the result 77. Clearly, the 
process specified must terminate with a definite result. 


Factorial Function 
Another example of a definition specitied by a process is that of the factorial 


function, which plays an important role in mathematics and statistics. Given a posi- 
tive integer n. n factorial is defined as the product of all integers between n and |. For 
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example, 5 factorial equals $ * 432l = 120, and 3 factorial equals 3*2* 1 = 6. 
O factorial is defined as 1. In mathematics, the exclamation mark ( ') is often used to 


denote the factorial function. We may therefore write the definition of this function as 
follows: 


nialif ns 9 g 
alen* (n-1) * (n-2)*... “Lif aso 


The three dots are really a shorthand for all the numbers between n — 3 and 2 multiplied. 
together. To avoid this shorthand in the definition of n! we would have to list a formula 
for n! for each value of n separately, as follows: i 


0 a1 

l!=1 

l =2*1 
EE E E 
E a a E 


Of course, we cannot hope to list a formula for the factorial of each integer. To 
avoid any shorthand and to avoid an infinite set of definitions, yet to define the function 


precisely, we may present an algorithm that accepts an integer n and returns the value 
of n!. 


prod = 1; 

for (x = n; x > 0; x--) 
prod *= x; 

return(prod); 


Such an algorithm is called iterative because it calls for the explicit repetition of 
some process until a certain condition is met. This algorithm can be translated readily 
into a C function that returns n! when n is input as a parameter. An algorithm may be 
thought of as a program for an “ideal” machine without any of the practical limitations 
of a real computer and may therefore be used to define a mathematical function. A C 
function, however, cannot serve as the mathematical definition of the factorial function 
because of such limitations as precision and the finite size of a real machine. 

Let us look more closely at the definition of n! that lists a separate formula for 
each value of n. We may note, for example, that 4! equals 4 * 3 * 2 x 1, which equals 
4 * 3!. In fact, for any n > 0, we see that n! equals n * (n — 1)!. Multiplying n by the 
product of all integers from n — 1 to 1 yields the product of all integers from n to 1. We 
may therefore define ` 

0! = 1 

lI! =1* 0! 

2!a2* 1! 

3! 23 * 2! 

4! a4 * 3! 


or, using the mathematical notation used earlier, 
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This definitivi oy appear quite strange, since it defines the factorial function in 
terms of itself. This secms io be a circular definition and totally unacceptable until we 
realize that the mathematica! notation is only a concise way of writing out the infinite 
number of equations necessary to define »! for each n. 0! is defined directly as 1. Once 
O! has been defined, detining 1! as 1 * 0! is not circular at all. Similarly, once 1! has 
been defined, defining 2! as 2 + 1! is equally straightforward. It may be argued that the 
latter notation is more precise than the definition of n! as ni * (A — 1) *---* 1} forn > 0 
because it does not resort to three dots to be filled in by the (it is hoped) logical intuition 
of the readef. Such a definition, which defines an object in terms of a simpler case of 
itself, is called a recursive definition. j | 

Let us see how the recursive definition of the factorial function may be used to 
evaluate 5!. The definition states that 5! equals 5 * 4!. Thus, before we’can evaluate 
5!, we must first evaluate 4!. Using the definition once more, we find that 4! = 
4 * 3!. Therefore. we must evaluate 3!. Repeating this process, we have that 


Each case is reduced to a simpler case until we reach the case of 0!, which is 
detined directly as 1. At line 6 we have a value that is defined directly and not as the 
factorial of another number. We may therefore backtrack from line 6 to line 1, returning 
the value computed in one line to evaluate the result of the previous line. This produces 


{ 
“6 Olel 

“Sibel tOhel*ilel 
4'2!=2ż]!=2*1=2 
3) 3l=3* Qa *226 
2' 442 4% 324% 6 = 24 
1' Sle 5 * 4! = 5 * 24 = 120 


Let us attempt to incorporate this process into an algorithm. Again, we want the 
algorithm to input a nonnegative integer n and to compute in a variable fact the non- 
negative integer that is factorial. 


1 if (n == 0) 

2 fact = 1; 

3 else { 

4 x=n-1l; i 

5 find the value of x!. Call it y; - 
6 fact =n* y; 

7 } /* end else */ 


Sec. 3.1 Recursive Definition and Processes 119 


This algorithm exhibits the process used to compute n! by the recursive definition. 
The key to the algorithm is, of course, line 5, where we are told-to “find the value of x!.” 
This requires reexecuting the algorithm with input x, since the method for computing 
the factorial function is the algorithm itself. To see that the algorithm eventually halts, 
note that, at the start of line: 5, x equdls n — |. Each time the algorithm is executed, 
its input is one-less than the preceding time, so that (since the Original input n was a 
nonnegative integer)-0 is eventually input to the algorithm. At that point, the algorithm 
simply returns 1. This value is returned to line 5, which asked for the evaluation of 
0!. The multiplication ef Y (which-equals 1) by n (which equals 1) is then executed 
and the result. is returned, This sequence of inultiplications and returns continues until 
‘the original n! has been evaluated, In the next section we will see how to convert this 
algorithm into a C program. J eT 
_ Of course, it is much simpler and more straightforward to use the iterative method 
for evaluation of the factorial function. We present the recursive method as a simple 
example to introduce recursion, not as a’ mort effective method of solving this particular 
problem. Indeed, all the problems in this section can'be solved more efficiently by 
iteration. However, later in this chapter and in subsequent chapters, we will come across 
eaamples that are. more easily solved by recursive methods, 


Multiplication of Natural Numbers 


Another example of a recursive definition is the definition of multiplication of 
natural numbers. The product a * b, where a and b are positive integers, may be defined 
as a added to itself b times. This is an iterative definition. An equivalent recursive . 
definition is i 


*b=aif b= 1 
*b=a*(5-1)¥řžaifb>i 


To evaluate 6 « 3 by this definition. we first evaluate 6 * 2 and then add 6. To 
evaluate 6 * 2, we first evaluate 6 * | and add 6. But 6 * | equals 6 by the first part of 
the definition. Thus 


6*3=642+65641+64+6 264646238 


The reader is urged to convert the definition above to a recursive algorithm as a simple 
exercise. 

Note the pattern that exists in recursive definitions. A simple case of the term to 
be défined is defined explicitly (in the case of factorial, 0! was defined as 1; in the case 
of multiplication, a * 1 = a). The other cases are defined by applying some Operation 
to the result of evaluating a simpler case. Thus n! is defined in terms of (n — 1)! and 
a * b in terms of a * (b — 1). Successive simplifications of any particular case must 
eventually lead to the explicitly defined trivial case. In the case of the factorial function, 
successively subtracting | from n eventually yields 0. In the case of multiplication, 
successively subtracting 1 from b eventually yields il. If this were not the case, the 
definition would be invalid. For example, if we defined 
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n! = (n+ 1)!/(n + 1) 
or ; js 


a*b=a* (b+1)-a 
we would be unable to determine the value of 5! or 6 * 3. (You are invited to attempt 
to determine these values using the foregoing definitions.) This is true-despite the fact 
that the two equations are valid. Continually adding one to n or b does not eventually , 


-produce an explicitly defined case. Even if 100! was defined explicitly, how'could the 
value of 101! be determined? 


FibonagplReappnae:: siting rij WA gay } 


. Let us examine.a less. fadeilien example. The Fébowteai sequence is ie — 


of i integers aali 6 si 


ie AELS E 13, 21,34, 20 


Each element in this sequence is thes sum ps the two preceding elements (for example, 
0+1 = 1,141 =2,14+2 = 3,2+3 = ..). If we let fib(O) = 0, fib(1) = 1, and so 
on, then we may define the Fibonace’ sequence by the following recursive definition: 


Fib(n) = n if n == 0 or ne= 1 
fib(n) = Fib(n - 2) + fib(n - dif n>=2 


To compute fib(6), for example, we may apply. the definition recursively to obtain 


fib(6) = fib(4) + Fib(S) = fib(2) + fib(3) + fib(S)= | 
Fib(0) + fib(1) + fib(3) + fib(S) = 0 + 1+ fib(3) + fib(S) = i 
1 + fib(1) + Fib(2) + fib(5) = 

141+ fib(0) + fib(1) + Fib(5) = ` 

2+ 0+1 fib(5) = 3 + fib(3) + Fib(4) = 

3 + fib(1) + fib(2) + fib(4) = 

3 + 1+ fib(0) + fib(1) + fib(4) = 

4+0 +14 fib(2) + fib(3) = 5 + fib(0) + fFib(1) + fib(3) = 
5+ 0+ 1+ fib(1) + fib(2) = 6+ 1+ Fib(0) + fib(1) = 
7+04+1=8 


Notice that the recursive definition of the Fibonacci numbers differs from the re- 
cursive definitions of the factorial function and multiplication. The recursive definition 
of fib refers to itself twice. For example, fib(6) = fib(4) + fib(5), so that in computing 
fib(6), fib must be applied recursively twice. However, the computation of fib(5) also 
involves determining fib(4), so that a greàt deal of computational redundancy occurs 
in applying the definition. In the foregoing example, fib(3) is computed three separate 
times. lt would*be much more efficient to “remember” the value of fib(3) the first time 
that it is evaluated and reuse it each time that it is needed. An iterative method of com- 
puting fib(n) such as the following is much more efficient: 
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if (n <= 1) 
return(n); 
lofib = 0; 
hifib = 1; 
for (i = 2; i <= n; i++) { 
x = lofib; 
lofib = hifib; 
hifib = x + lofib; 
} /* end for */ 
return(hifib); 


Compare the number of additions (not including increments of the index variable 
i) that are performed in computing fib(6) by this. algorithm and by using the recursive 
definition. In the case of the factorial function, the same number of multiplications must 
be performed in computing n! by the recursive and iterative methods. The same is true 
of the number of additions in the two methods of computing multiplication. However, 
in the case of the Fibonacci numbers, the recursive method is far more expensive than 


the iterative. We shall have more to say about the relative merits of the two methods in 
a later section. 


Binary Search 


You may have received the erroneous impression that recursion is a very handy 
tool for defining mathematical functions but has_no influence in more practical com- 
puting activities. The next example illustrates an application of recursion to one of the 
most common activities in computing: that of searching. < 

Consider an array of elements in which objects have been placed in some order. 
For example, a dictionary or telephone book may be thought of as an array whose entries 
are in alphabetical order. A company payroll file may be in the order of employees’ 
social security numbers. Suppose that such an array exists and that we wish to find a 
particular element in it. For example, we wish to look up a name in a telephone book, 
a word in a dictionary, or a particuiar employee in a personnel file. The process used to 
find such an entry is called a search. 

Since searching is such a common activity in computing, it is desirable to find an 
efficient method for performing it. Perhaps the crudest search method is the sequential 
or linear search, in which each item of the array is examined in turn and compared with 
the item being searched for until a match occurs. If the list is unordered and haphazardly 
constructed, the linear search may be the only way to firid anything in it (unless, of 
course, the list is first rearranged). However, such a method would never be used in 
looking up a name in a telephone book. Rather, the book is opened to a random page 
and the names on that page are examined. Since the names are ordered alphabetically, 
such an €xamination would determine whether the search should proceed in the first or 
second half of the book. 

Let us apply this idea to searching an array. If the array contains only one element, 
the problem is trivial. Otherwise, compare the item being searched for with the item 
at the middle of the array. If they are equal, the search has been completed successfully. 
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If the middle element is greater than the item being searched for, the search process is 
repeated in the first half of the array {since if the item appears anywhere it must appear 
in the first half); otherwise, the process is repeated in the second half. Note that each 
time a comparison is made, the number of clements yet to be searched is cut in half. For 
large arrays, this method is superior to the sequential search in which each comparison 
reduces the number of elements yet to be searched by only one. Because of the division 
of the array to be searched into two equal parts, this search method is called the binary 
search. 

Notice that we have quite naturally defined a binary search recursively. If the item 
being searched for is not equal to the middle element of the array, the instructions are to 
search a subarray using the same method. Thus the search method is defined in terms of 
itself with a smaller array as input. We are sure that the process will terminate because 
the input arrays become smaller and smaller, and the search of a one-element array is 
defined nonrecursively, since the middle element of such an array is its only element. 

We now present a recursive algorithm to search a sorted array a for an element 
x between aflow] and a{high). The algorithm returns an index of a such that alindex] 
equals x if such an index exists between low and high. If x is not found in that portion 
of the array, binsrch returns —} (in C, no element af — 1] can exist). 


1 if (low > high) 

2 return(-1); 

3 mid = (low + high) / 2; 

4 if (x == a{mid)) 

5 return(mid); 

6 if (x < a[mid}) 

7 search for x in a[low] to a[mid - 1); 
8 else 

9 search for x in a[mid + 1] to afhigh); 


Since the possibility of an unsuccessful search is included (that is, the element 
may not exist in the array), the trivial case.has been altered somewhat. A search on a 
one-element array is not defined directly as the appropriate index. Instead that element 
is compared with the item being searched for. If the two items are not equal, the search 
continues in the “first” or “second” half—each of which contains no elements. This 
case is indicated by the condition low > high, and its result is defined directly as — 1. 

Let us apply this algorithm to an example. Suppose that the array a centains the 
elements 1, 3. 4, 5, 17, 18, 31, 33, in that order, and that we wish to search for 17 (that 
is, x equals 17) between item 0 and item 7 (that is, Jow is 0. high is 7). Applying the 


algorithm, we have 
Line 1: Is low > high? It is not, so execute line 3. 
Line 3: mid = (0 + 7)/2 = 3. 
Line 4: Is x == a[3]? 17 is not equal to 5. so execute line 6. 
Line 6: Is x < a[3]? 17 is not less than 5, so perform the else clause at line 8. 
Line 9: Repeat the algorithm with ow = mid + 1 = 4 and high = high = 7: 
i.e., Search the upper half of the array. 
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Line 1: 
Line 3: 
Line 4: 
Line 6: 


Line 7: 


Line 1: 
Line 3: 
Line 4: 


Is 4 > 7? No, so execute line 3. 

mid = (4+ 7)/2 = 5. 

Isx == a[5]? 17 does not equal 18, so execute line 6. 

Is x < a[5]? Yes, since 17 < 18, so search for x in a{low] to 

a|mid — 1]. TS 

Repeat the algorithm with low = low = 4 and high = mid — 1 = 4. 
We have isolated x between the fourth and the fourth elements of a. 
Is 4 > 4? No, so execute line 3. 

mid = (4 + 4)2 = 4. 


Since a[4] == 17, return mid = 4 as the answer. 17 is indeed the 
fourth element of the array. 


Note the pattern of calls to and returns from the algorithm. A diagram tracing this 
pattern appears in Figure 3.1.1. The solid arrows indicate the flow of control through 
the algarithm and the recursive calls. The dotted lines indicate returns. Since there are 
no steps to be executed in the algorithm after line 7 or 9, the returned result is returned 
intact to the previous execution. Finally, when control returns to the.original execution. 
the answer is returned to the caller. 

Let us examine how the algorithm searches for an item that does not appear in the 


array. Assume the array a as in the previous example and assume that it is searching 
for x, which equals 2. 


Inn 


Answer 


(Answer is found) 


Figure 3.1.1 Diagrammatic representation of the binary search algorithm. 
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Line 1: Is low > high? 0 is not greater than 7, so execute line 3. 

Line 3: mid = (0 + 7)/2 = 3. 

Line 4: Is x == a[3]? 2 does not equal 5, so execute line 6. 

Line 6: Is x < a[3]? Yes, 2 < 5, so search for x in a{low] to a[mid — 1}. 

Line 7: Repeat the algorithm with low = low = O and high = mid — 1 = 2. 

: If 2 appears in the array, it must appear between a[0] and a[2) inclusive. 

Line 1: Is 0 > 2? No, execute line 3. 

Line 3: mid = (0 + 2)/2 = 1. 

Line’4: Is 2 == a{1]? No, execute line 6.. oA 

Line 6: Is 2 < aļ| 1]? Yes, since 2 < 3. Search for x in allow] to a\mid — 1}. 

Line 7: Repeat the algorithm with low = low = 0 and high = mid — 1 = 0. 
If x exists in a it must -be the first element. 

Line 1: Is 0 > 0? No, execute line 3. 

Line 3: mid = (0 + 0)/2 = 0. 

Line 4: Is 2 == a[0]? No, execute line 6. : 

Line 6: Is 2 < a{0}? 2 is not Jess than 1, so perform the else clause at line 8. 

Line 9: Repeat the algorithm with low = mid + 1 = 1 and high = high = 0. 


Line 1: Is low > high? 2 is greater than 1, so ~ is returned. The item 2 does 
not exist in the array. 


Properties of Recursive Definitions or Algorithms 


Let us summarize what is involved in a recursive definition or algorithm. One 
important requirement for a recursive algorithm to be correct is that it not generate 
an infinite sequence of calls on itself. Clearly, any algorithm that does generate such 
a sequence can never terminate. For at least one argument or group of arguments, a 
recursive function f must be defined in terms that do not involve f. There must be a “way 
out” of the sequence of recursive calls. In the examples of this section the nonrecursive 
portions of the definitions were 


factorial: Of =1 
multiplication: a* l=a 
Fibonacci seq.: fib(0) = 0; fib(1) =1 
binary search: if (low > high) 
: return(-1); 
if (x == a[mid]) 
return(mid); 


Without such a nonrecursive exit, no-recursive function can ever be computed. Any 


instance of a recursive definition or invocation of a recursive algorithm must eventually 
reduce to some manipulation of one or more simple, nonrecursive cases. 
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Write an iterative algorithm to evaluate a * b by using addition, where u and b are 
nonnegative integers. 

Write a recursive definition of a + b, where a and b are nonncgative integers, in terms 
of the successor function succ, defined as 


succ(x) 
int x; 


{ 
return(x++); 
} /* end succ */ 


Let a be an array of integers. Present recursive algorithms to compute: 
(a) The maximum element of the array 

(b) The minimum element of the array 

ic) The sum of the elements of the array 

td) The product of the elements of the array 

(e) The average of the elements of the array 


Evaluate each of the following. using both the iterative and recursive definitions. 
(a) 6! ` 


(b) 9! 

(c) 100 3 
(d) 6*4 

(e) fib(10) 
(f) fib(11) 


Assume that an array of ten integers contains the elements 
1, 3, 7, 15, 21, 22, 36, 78, 95, 106 


Use the recursive binary search to find each of the following items in the array. 
(a) | 

(b) 20 

(c) 36 


Write an iterative version of the binary search algorithm. (Hint: Modify the values of 
low and high directly.) 


Ackerman’s function is defined recursively on the nonnegative integers as follows: 


a(m,n) = n+] ifm == 
a(m,n) = a(m — 1, 1) ifm! = 0,n == 
a(m.n) = a(m — 1, a(m, n — 1)) if m! = 0,n! =0 


(a) Using the above definition, show that a(2,2) equals 7. 


(h} Prove that a(m.n) is defined for all nonnegative integers m and n. 
(c) Can you find an iterative method of computing a(m,n)? 


Count the number of additions necessary to compute fib(n) for O <= n <= 10 by the 
iterative and recursive methods, Does a pattern emerge? 


If an array contains n elements, what is the maximum number of recursive calls made 
by the binary search algorithm? 
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3.2 RECURSION IN C 
Factorial in C 


The C language allows a programmer to write subroutines and be ae that call 
themselves. Such routines are called recursive. 


The recursive algorithm to compute n! may be directly SEL RN into aC function 
as follows: 


int fact(int n) 
{ ; 
int x, y; 


if (n == 0) 
return(1); 
x = n-l; 
y = fact(x); i 
return(n.* y); t 
} /* end fact */ 


In the statement y = fact(x); the function fact calls itself. This is the essential 
ingredient of a recursive routine. The programmer assumes that the function being com- 
puted has already been written and uses it in its own definition. However, the program- 
mer must ensure that this does not lead to an endless series of calls. 

Let us examine the execution of this function when it is called by another program. 
For example, suppose that the calling program contains the statement 


printf("%d", fact(4)); 


When the calling routine calls fact, the parameter n is set equal to 4. Since n is not 0, 
x is set equal to 3. At that point, fact is called a second time with an argument of 3. 
Therefore, the function fact is reentered and the local variables (x and y) and parameter 
(n) of the block are reallocated. Since execution has not yet left the first call of fact, 
the first allocation of these variables remains. Thus there are two generations of each of 
these variables in existence simultaneously. From any point within the second execution 
of fact, only the most recent copy of these variables can be referenced. 

In general, each time the function fact is entered recursively, a new set of local 
variables and parameters is allocated, and only this new set may be referenced within 
that call of fact. When a return from fact to a point in a previous call takes place, the 
most recent allocation of these variables is freed, and the previous copy is reactivated. 
This previous copy is the one that was allocated upon the original entry to the previous 
call and is local te that call. i 

This description suggests the use of a stack to keep the successive generations 
of local variables and parameters.: This stack is maintained by the C system and is 
invisible to the user. Each time that a recursive function is entered, a new allocation of 
its variables is pushed on top of the stack. Any reference to a local variable or parame- 
ter is through the current top of the stack. When the function returns, the stack is popped, 
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(a) (Initially). (b) fact (4). (c) fact (3), (d) fact (2). 
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(i), v = faer (2) G) v = fact (3). © (k) printf (d, faci (4)). 


Figure 3.2.1, Stack at various times during execution. (An asterisk indicates 
an uninitialized value.) : 


the top allocation is freed, and the previous allocation becomes the current stack top to 
be used for referencing local variables. This mechanism is examined more closely in 
Section 3.4, but for now, let us see how it is applied in computing the factorial function. 

Figure 3.2.1 contains a series of snapshots of the stacks for the variables n, x, and 
y as execution of the fact function proceeds. Initially, the stacks are empty, as illustrated 
by Figure 3.2.1a. After the first call on fact by the calling procedure, the situation is as 
shown in Figure 3.2.1b, with » equal to 4. The variables x and y are allocated but not 
initialized. Since n does not equal 0, x is set to 3 and fact(3) is called (Figure 3.2.1c). 
The new value of n does not equal 0; therefore x is set to 2 and fact(2) is called (Figure 
3.2.1d). , 

This continues until n equals 0 (Figure 3.2.1f). At that point the value 1 is returned 
from the call to fact(0). Execution resumes from the point at which fact(0) was called, 
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which is the assignment of the returned value tọ the copy of y declared in fact( 1). This 
is illustrated by the status of the stack Shown in Figure 3.2.1g, where the variables 
allocated for fact(0) have been freed and y is set to 1. 

The statement return(n * y) is then executed, multiplying the top values of n 
and y to obtain 1 and returning this value to fact(2) (Figure 3.2.1h): This process is 
repeated twice more, until finally the value of y in fact(4) equals 6 (Figure 3.2.1)). The 
Statement return(n * y) is executed one more time. The product 24 is returned to the 
calling procedure where it is printed by the statement 


printf("%d", fact(4)); 


Note that each time that a recursive routine returns, it returns to the point imme- 
diately following the point from which it was called. Thus, the recursive call to Sfact(3) 
returns to the assignment of the result to X within fact(4), but the recursive call to fact) 
returns to the printf statement in the calling routine. 

Let us transform some of the other recursive definitions and processes of the pre- 
vious section into recursive C programs. It is difficult to conceive of a C programmer 
writing a function to compute the product of two positive integers in terms of addition, 
since an asterisk performs the multiplication directly. Nevertheless, such a function 
can serve as another illustration of recursion ‘in C. Following closely the detinition of 
multipticatio. in the previous section, we may write: 


int mult(int a, int b) 


return(b == 1 ? a: mult(a, b-1) 4 a);__ - 
} /* end mult */ 


Notice how similar this Program is to the recursive definition of the last section. We 
leave it as an exercise for you to trace through the execution of this function when it is 
called with two positive integers. The use of stacks is a great aid in this tracing process. 

This example illustrates that a recursive function may invoke itself even within 
a statement assigning a value to the function. Similarly, we could have written the re- 
cursive fact function more compactly as 


int fact(int n) 
{ 


return(n == 0? 1; n+ fact(n-1)); 
} /* end fact */ 


This compact version avoids the explicit use of local variables x (to hold the value 
of n — 1) and y (to hold the value of Jact(x)). However, temporary locations are set aside 
anyway for these two values upon each invocation of the function. These temporaries 
are treated just as any explicit local variable. Thus, in tracing the action of a recursive 


routine, it may be helpful to declare all temporary variables explicitly. See if it is any 
easier to trace the following more explicit version of mult: 
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int mult(int a, int b) 
{ 


int c, d, sum; 


if (b == 1) 
return(a); 
c = b-1; 
d = mult(a, c); 
sum = d+a; 
return(sum) ; 
} /* end mult */ 


Another point that should be made is that it is particularly important to check for 
the validity of input parameters in a recursive routine. For example, let us examine the 
execution of the fact function when it is invoked by a statement such as 


printf("\n%d", fact(-1)); 


, 


Of course, the fact function is not designed to produce a meaningful result for negative 
input. However, one of the most important things for a programrher to learn is that 
a function invariably will be presented at some time with invalid input and, unless 
provision is made for such input, the resultant error may be very difficult to trace. 

For example, when — | is passed as a parameter to fact, so that n equals — 1, x is set 
to —2 and —2 is passed to a recursive call on fact. Another set of n, x, and y is allocated, 
nis set to —2, and x becomes —3. This process continues until the program either runs 
out of time or space or the value of x becomes too small. No message indicating the true 
cause of the error is produced. 

If fact were originally called with a complicated expression as its argument and 
the expression erroneously evaluated to a negative number, a programmer might spend 
hours searching for the cause of the error. The problem can be remedied by revising the 
fact function to check its input explicitly, as follows: 


int fact(int n) 
{ 


int x, y; g 


if (n <0) { 
printf("%s", "negative parameter in the factorial function"); 
exit(1); 

} /* end if */ 

if (n == 0) 
return(1); 

x = n-l; 

y = fact(x); 

return(n * y); 

3 /* „end fact */ 
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Similarly, the function mult must guard against a nonpositive value in the second pa- 
rameter. 


Fibonacci Numbers in C 


We now turn our attention to the Fibonacci sequence. A C program to compute 
the nth Fibonacci number can be modeled closely after the recursive definition: 


int fib(int n) 
{ 
int x, y; 


if (n <= 1) 
return(n); 
x = fib(n-1); 
y = fib(n-2); 
return(x + y); 
} /* end fib */ 


Let us trace through the action of this function in computing the sixth Fibonacci number. 
You may compare the action of the routine with the manual computation we performed 
in the last section to compute fib(6). The stacking process is illustrated in Figure 3.2.2. 
When the program is first called, the variables n, x, and y are allocated, and n is set to 
6 (Figure 3.2.2a). Since n > 1, n — 1 is evaluated and fib is called recursively. A new 
~ „set of n, x, and y is allocated, and n is set to 5 (Figure 3.2.2b). This process continues 
(Figure 3.2.2c-f) with each successive value of n being one less than its predecessor, 
until fib is called with n equal to 1. The sixth call to fib returns | to its caller, so that the 
fifth allocation of x is set to 1 (Figure 3.2.2g). 

The next sequential statement, » = fib(n — 2), is then executed. The value of n 
that is used is the most recently allocated one, which is 2. Thus we again call on fib 
with an argument of 0 (Figure 3.2.2h). The value of 0 is immediately returned, so that 
Ý in fib(2) is set to 0 (Figure 3.2.2i). Note that each recursive call results in a return to 
the point.of call, so that the call of fib(1) returns to the assignment to-x, and the call of 
fib(0) returns to the assignment to y. The next statement to be executed in fib(2) is the 
statement that returns x + y = 1 + 0 = 1 to the statement that calls fib(2) in the gen- 
eration of the function calculating fib(3). This is the assignment to x, so that x in fib(3) 
is given the value fib(2) = 1 (Figure 3.2.2j). The process of calling and pushing and 
returning and popping continues until finally the routine returns for the last time to the 
main program with the value 8. Figure 3.2.2 shows the stack up to the point where fib(5) 
calls on fib(3), so that its value can be assigned to y. The reader is urged to complete 
the picture by drawing the stack states for the remainder of the program execution. 

This program illustrates that a recursive routine may call itself a number of times 
with different arguments. In fact, as long as a recursive routine uses only local variables, 
the programmer can use the routine just as he or she uses any other and assume that it 
performs its function and produces the desired value. He or she need not worry about 
the underlying stacking mechanism. 
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fl) (m) (n) fo) 


(p) q) tn 
ie Figure 3.2.2 The recursion stack of the Fibonacci function. 
Binary Search inc 
Let us now present a C program for the binary search. A function to do this accepts 


an array a and an element xas input and returns the index i in a such that ali] equals 


x, or —1 if no such i exists. Thus the function binsrch might be invoked in a statement 
such as 


i= binsrch(a, x) 
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However, in looking at the binary search algorithm of Section 3.1 as a model for a 
recursive C routine, we note that two other parameters are passed in the recursive calls. 


int binsrch(int a[], int x, int low, int high) 
{ 
‘int mid; 


if Clow > high) 
- return(-1); 
mid = (low + high) / 2; 
return(x == a[mid] ? mid: x< a[mid] ? use 
binsrch(a, x, low, mid-1) : 
binsrch(a, x, mid+1, high)); 
} /* end binsrch */ 


When binsrch is first called from another routine to search for x in an array de- 
clared by i i 


int a[ARRAYSIZE] 
of which the first n elements are occupied, it is called by the statement 


i = binsrch(a, x, 0, n-1); 


_ You are urged to trace the execution of this routine and. follow the stacking and 
unstacking using the example of the preceding section, where a is an array of 8 elements 
(n = 8) containing 1, 3, 4, 5, 17, 18, 31. 33, in that order, The value being searched for 
is 17 (x equals 17). Note that the array a is stacked for each recursive call. The values 
of low and high are the lower and upper bounds of the array a, respectively. 

In the course of tracing through the binsrch routine. you may have noticed that 
the values of the two parameters a and x do not change throughout its execution. 
Each time that binsrch is called the Same array is searched for the same element; 
it is only the upper and lower bounds of the search that change. It therefore seems 
wasteful to stack and unstack these two Parameters each time the routine is called 
recursively. 

One solution is to allow a and x to be global variables, declared before the program 
by 


int a[ARRAYSIZE]; 
int x; y 


The routine is called by a statement stich as 


i = binsrch(0, n-1) 
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In this case, all references to a and x are to the global allocations of a and x declared at 
the beginning of the source file. This enables binsrch to access a and x without allocat- 
ing additional space for them. All multiple allocations and freeings of space for these 
parameters are eliminated. 

We may rewrite the binsrch function as follows: 


int binsrch(int low, int high) 
{ 
int mid; 


if (low > high) 
return(-1);. 
mid = (low + high) / 2; 
return (x == a[mid] ? mid : x < a[mid] ? binsrch(low, mid-1) : 


binsrch(mid+1, high)); 
} /* end binsrch */ 


Using this scheme, the variables a and x are referenced with the extern attribute and are- 
not passed with each recursive call to binsrch. a and x do not change their values and are 
not stacked. The programmer wishing to make use of binsrch in a program only needs 


to pass the parameters low and high. The routine could be invoked with a statement 
such as i 


i = binsrch(low, high); 
Recursive Chains 


A recursive function need not call itself See Rather, it may call itself indi- . 
rectly, as in the following exaniple: 


a(formal parameters) b(formal parameters) 
{ 

b(ar guments) ; a(argunents) . 
} /tend at/ } /tend b*/ 


In this example function a calls b, which may in turn call a, which may again call b. 
Thus both a and b are recursive, since they indirectly call on themselves. However, 
the fact that they are recursive is not evident from examining the body of either of the 
routines individually. The routine a seems to be calling a separate routine b and it is 
impossibie to determine, by examining a alone, that it may call itself indirectly. 

More than two routines may participate in a recursive chain. Thus a routine a 
may call b which calls c, ... which calls z, which calls a. Each routine in the chain may 
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potentially call itself and is therefore recursive. Of course, the programmer must ensure 
that such a program does not generate an infinite sequence of recursive calls, 


Recursive Definition of Algebraic Expressions 


As an example of a recursive chain, consider the following recursive group of 
definitions: 


1. An expression is a term followed by a plus sign followed by a term, or a term 
alone. 


2. A term is a factor followed by an asterisk followed by a factor, or a factor alone. 
3. A factor is either a letter or an expression enclosed in parentheses. 


Before looking at some examples, note that none of the foregoing three items is 
defined directly in terms of itself. However, each is defined in terms of itself indirectly. 
An expression is defined in terms of a term, a term in terms of a factor, and a factor in 
terms of an expression. Similarly, a factor is defined in terms of an expression, which 
is defined in terms of a term, which is defined in terms of a factor. Thus the entire set 
of definitions forms a recursive chain. 

Let us now give some examples. The simplest form of a factor is a letter. Thus 
A, B, C, Q, Z, M are all factors. They are also terms, since a term may be a factor 
alone. They are also expressions, since an expression may be a term alone. Since A is 
an expression, (A) is a factor and therefore a term as well as an expression. A + B is 
an example of an expression that is neither a term nor a factor. (A + B), however, is all 
three. A * B is a term and therefore an expression, but it is not a factor. A * B + C is an 
expression that is neither a term nor a factor. A * (B + C) is aterm and an expression 
but not a factor, - ~- a =Š r we 

Each of the foregoing examples is a valid expression. This can be shown by ap- 
plying the definition of an expression to each of them. Consider, however, the string 
A + *B. It is neither an expression, term, nor factor. It would be instructive for you 
to attempt to apply the definitions of expression, term, and factor to see that none of 
them describe the string A + *B. Similarly, (A + B*)C and A + B + C are not valid 
expressions according to the preceding definitions. 

Let us write a program that reads and prints a character string and then prints 
“valid” if it is a valid expression and “invalid” if it is not. We use three functions to 
recognize expressions, terms, and factors, respectively. First, however, we present an 
auxiliary function getsymb that operates on three parameters: str, length, and ppos. str 
contains the input character string. length represents the number of characters in str. 
ppos points to an integer pos whose value is the position in str from which we last 
obtained a character. If pos < length, getsymb returns the character str[pos] and incre- 
ments pos by 1. If pos > = length, getsymb returns a blank. 


int getsymb(char str[], int length, int *ppos) 
{ 


char c; 
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if (*ppos < length) 
c = str[*ppos]; 


else 
Guts 

(*ppos)++; 

return(c); 


} /* end getsymb */ 


The function that recognizes an expression is called expr. It returns TRUE (or 1) 
if a valid expression begins at position pos of str and FALSE (or 0) otherwise. It also 
resets pos to the position following the longest expression it can find. We also assume a 
function readstr that reads a string of characters, placing the string in str and its length 


in length. 


Having described the functions expr and readstr, we can write the main routine 
as follows. The standard library ctype.h includes a function isalpha called by one of the 


functions below, 


#include <stdio.h> 
#include <ctype.h> 


#define TRUE 1 
#define FALSE 0 


#define MAXSTRINGSIZE 100 


void readstr(char *, int); 


‘int expr(char *, 
int term(char *, 


int, int *); 
int, int *); 


‘int getsymb(char *, int, int *); 
int factor(char *, int, int *); 


void main() > 


{ 


char str [MAXSTRINGSIZE] ; 
int length, pos; 


readstr(str, &length); 


pos = 0; 


if (expr(str, length, &pos) == TRUE & pos >= length) 
_ printf("%s", "“valid"); 


i else 


r 


} /* end main */ 
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printf("%s", "invalid"); 
/* The condition can fail for one (or both) of two 
/* reasons. If expr(str, length, &pos) == FALSE 
/* then there is no valid expression beginning at 
/* pos. If pos < length there may be a valid 
& expression starting at pos but it does not 
occupy the entire string. 


ay 
KA 
*/ 
*/ 
37 
*/ 
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The functions factor and term are much like expr except that they are responsible 
for recognizing factors and terms, respectively. They also reposition pos to the position 
following the longest factor or term within the string str that they can find. + 

The code for these routines adheres closely to the definitions given earlier. Each 
of the routines attempts to satisfy one of the criteria forthe entity being recognized. If 
one of these criteria is satisfied. TRUE is returned. If none of these criteria are satistied, 
FALSE is returned. 


int expr(char str[], int length, int *ppos) 


{ 
[= look for a term j */ 
if (term(str, length, pos) == FALSE) 
return(FALSE) ; 
ft We have found a term; look at the */ 
/* next symbol. / 


if (getsymb(str, length, ppos) != '+') { 
/* We have found the longest expression */ 
/* (a single term). Reposition pos so it */ 


/* refers to the last position of +/ 
7 the expression. a/ 
(*ppos)--; 

return(TRUE) ; 


} /* end if */ 
/* At this point, we have found a term anda */ 
/* plus sign. We must look for another term. */ 
return(term(str, length, ppos)); 

} /* end expr */ 


The routine ferm that recognizes terms is very similar, and we present it without 
comments. 


int term(char-str[], int length, int *ppos) 


if (factor(str, length, ppos) == FALSE) 
return(FALSE) ; 

if (getsymb(str, length, ppos) != '*') { 
(*ppos)--; 
return(TRUE) ; 

} /* end if */ 

return(factor(str, length, ppos)); 

} /* end term */ 


The function factor recognizes factors and should now be fairly straightforward. 
It uses the common library routine isalpha (this function is contained in the library 


ctype.h), which returns nonzero if its character parameter is a letter and zero (or FALSE) 
otherwise. 
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int factor(char str[], int length, int *ppos) 
{ 
int c; 


if ((c = getsymb(str, length, ppos)) != '(') 
return(isalpha(c)); 
return(expr(str, length, ppos) & 
getsymb(str, length, ppos) == ')'); 
} /* end factor */ 


All three routines are recursive, since each may call itself indirectly. For example, 
if you trace through the actions of the program for the input string 


“(a*b +c*d) + (e*(f) + g),” you will find that each of the routines expr; term, and 
factor calls on itself. 


EXERCISES 


3.2.1. Determine what the following recursive C function computes. Write an iterative function 
to accomplish the same purpose. = 


int func(int n) 


if (n == 0) 
return(0); 
return(n + func(n-1)); 
} /* end func */ 


3.2.2. The C expression m % n yields the remainder of m upon division by n. Define the greatest 
common divisor (GCD) of two integers x and y by 


gcd(x,y) = y if (y <= x & x% y == 0) 
gcd(x,y) = gcd(y,x) if (x < y) 
gcd(x,y) = gcd(y, x % y) otherwise 


Write a recursive C function to compute ged(.x,y). Find an iterative method for computing 
this function. 

Let comm(n,k) represent the number of different committees of k people that can be 
formed. given n people from whom to choose. For example, comm(4,3) = 4, since given 
four people A. B, C. and D there are four possible three-person committees: ABC, ABD, 
ACD, and BCD. Prove the identity: 


oy 
io 
w 


comm(n,k) = comm(n - 1,k) + comm(n - 1,k - 1) 


Write and test a recursive C program to compute comm(n,k) forn, k >= 1. 
3.2.4. Detine a generalized fibonacci sequence of f 0 and f | as the sequence gfib( f 0, f 1,0). 
ghh( fO.f 1). g fib(f 0.1.2). 2... where 
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3.2.5. 


3.2.6. 


gfib(f, fl, 0) = 70 
gfib(f0, fl, 1) = fl- 
gfib(f0, fl, n) = gfib(f0, fl, a - 1) 
+ gfib(fd, fl, n+ 2) ifnel 


Write a recursive C function to compute gfib(/0, f1,n). Find an iterative method for 
computing this function. ji 

Write a recursive C function to compute the number of sequences of n binary digits that 
do not contain two Is in a row. (Hint: Compute how many such sequences exist that start 
with 0, and how many exist that start with a I.) 


An order n matrix is an n X n array of numbers. For example, 


(3) 
isa 1X | matrix, 
1 3 
-2 8 
is a2 x 2 matrix and 
l 2 4 6 
2'-5 0 8 
3 7 6 4 
2 0 9 -i 


is a 4 X 4 matrix. Define the mnor of an element x in a matrix as the submatrix formed 
by deleting the row and column containing x. In the preceding example of a 4 X 4 matrix., 
the minor of the element 7 is the 3 X 3 matrix 


gye = 
2 %-& 
2 9 -I 


Clearly the order of a minor of any element is 1 less than the order of the original matrix. 
Denote the minor of an element afi, j] by minor(a{i,j]). A 


Define the determinant of a matrix u (written det(a)) recursively as follows: 
1. Ifaisal X 1 matrix (x), dea) = x. 
2. If ais of an order greater than 1, compute the determinant of u as follows: 


(a) Choose any row or column. For each element a{i,/] in this row or column form the 
product 


power(-1,i + j) * ali,j] * det(minor(aCi,j1)) 


where i and j are the row and column positions cf the element chosen, a{i,/] is the 
element chosen, det(minor(a{i,j])) is the determinant of the minor of alij), and 
power(m,n) is the value of m raised to the nth power. 

(b) det(a) = sum of all these products. 
(More concisely, if n is the order of a, 
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det(a) = X` power(-1, i + j) * ali,j] * det(minor(ali,j})), for any j 
or 


det(a) = S poner(-1, i + j) * ali,j] * det(minor(al i,j})), for any 7). 
i if P a 


Write a C program that reads a, prints a in matrix form, and prints the value of det(a), 
where det is a function that computes the determinant of a matrix. i 


3.2.7. Write a recursive C progre : to sort an array a as follows: 


1. Let k be the index of the middle element of the array. 
2. Sort the elements up to and including afk]. 

3. Sort the elements past afk]. 

4. Merge the two subarrays into a single sorted array. 


This method is called a merge sort. 


3.2.8. Show how to transform the following iterative procedure into a recursive procedure. fli) 
is a function returning a logical value based on the value of i, and g(i) is a function that 
returns a value with the same attributes as i. ig , 


void iter(int n) 


{ 


int i; 


T=; 
while(f(i) == TRUE) { 
/* any group of C statements that */ 
/* does not change the value of i */ 
i = 9(i); 
} /* end while */ 
} /* end iter */ 


3.3 WRITING RECURSIVE PROGRAMS 


In the last section we saw how to transform a recursive definition or algorithm into a 
C program. It is a much more difficult task to develop a recursive C solution to a prob- 
Jem specification whose algorithm is not supplied. It is not only the program but also 
the original definitions and algorithms that must be developed. In general. when faced 
with the task of writing a program to solve a problem there is no reason to look for 
a recursive solution. Most problems can be solved in a straightforward manner using 
nonrecursive methods. However, some problems can be solved logically and most el- 
egantly by recursion. In this section we shall try to identify those problems that can 
be solved recursively. develop a technique for finding recursive solutions, and present 
some examples. 

Let us reexamine the factorial function. Factorial ig probably a prime-example of a 
problem that should not be solved recursively, since the iterative solution is so direct and 
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simple. However, Jet us examine the elements that make the recursive solution work. 
First of all, we can recognize a large number of distinct cases to solve. That is, we want 
to write a program to compute 0!, 1!, 2!, and so on. We can also identify a “trivial” case 
for which a nonrecursive solution is directly obtainable. This is the case of 0!, which 
is defined as 1. The next step is to find a method of solving a “complex” case in terms 
of a “simpler” case. This allows reduction of a complex problem to a simpler problem. . 
The transformation of the complex case to the simpler case should eventually result in ` 
the trivial case. This would mean that the complex case is ultimately defined in terms 
of the trivial case. i ; 

Let us examine what this means when applied to the factorial function. 4! is a” 
more “complex” case than 3!. The transformation that is applied to the number 4 to 
obtain the number 3 is simply the subtraction of 1. Repeatedly subtracting | from 4 
eventually results in 0, which is a “trivial” case. Thus if we are able to define 4! in 
terms of 3!, and in general n! in terms of (n — 1)!, we will be able to compute 4! by 
first working our way down to 0! and then working our way back up to 4! using the 


definition of n! in terms of (n — 1)!. In the case of the factorial. function we have such 
a definition, since 


ni =n* (n- 1)! 
Thus 4! = 4*3!=4*3%*2! =4*3%2* 1! =4*3%2%1*0! =4%2%2% 1% 
| = 24. Hia 

These are the essential ingredients of a recursive routine—being able to define 
a “complex” case in terms of a “simpler” case and having a directly solvable (nonre- 
cursive) “trivial” case. Once this has been done, one can develop a solution using the 


assumption that the simpler case has already been solved. The C version of the factorial 
function assumes that (n — 1)! is defined and uses that quantity in computing m!. 

Let us see how these ideas apply to other examples of the previous sections. In 
defining a * b, the case of b = | is trivial, since in that case, a * b is defined as a. In 
general, a * b may be defined in terms of a * (b — 1) by the definition a * b = a » (b 
— |) + a. Again the complex case is transformed into a simpler case by subtracting 
1, eventually leading to the trivial case of b = 1. Here the recursion is based on the 
second parameter, b, alone. 

In the case of the Fibonacci function, two trivial cases were defined: fib(0) = 0 
and fib(1) = 1. A complex case, fib(n), is then reduced to two simpler cases: fib(n — 1) 
and fib(n — 2). It is because of the definition of fib(n) as fib(n — 1) + fib(n — 2) that two 
trivial cases directly defined are necessary. fib(1) cannot be defined as fib(0) + fib(— 1), 
because the Fibonacci function is not defined for negative numbers. 

The binary search function is an interesting case of recursion. The recursion is 
based on the number of elements. in the array that must be searched. Each time the 
routine is called recursively, the number of elements to be searched is halved (approxi- 
mately). The trivial case is the one in which there are either no elements to be searched 
or the element being searched for is at the middle of the array. If low > high, the first 
of these two conditions holds and —1 is returned. If x=a[mid], the second condition 
holds and mid is returned as the answer. In the more complex case of high — low + | 
elements to be searched, the search is reduced to taking place in one of two subregions. 
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1. The lower half of the array from /ow to mid — } 
2. The upper half of the array from mid + ! to high 


Thus a complex case (a large area to be Searched) is reduced to a simpler case (an 
area to be searched of approximately half the ‘size of the original area). This eventually 


. reduces to a comparison with a single element (a[mid]) or a search within an array of 
no elements. 


The Towers of Hanoi Problem 


“Thus far we have been looking at recursive definitions and examining how they 
fit the pattern we have established. Let us now look at a problem that is not specified in 
terms of recursion and see how we can use recursive techniques to produce a logical and 
elegant solution. The problem is the “Towers of Hanoi” problem whose initial setup is 
shown in Figure 3.3.1. Three pegs, A, B, and C, exist. Five disks of differing diameters 
are placed on peg A so that a larger disk is always below a smaller disk. The object is 
to move the five disks to peg C, using peg B as auxiliary. Only the top disk on any peg 
may be moved to any other peg, and a larger disk may never rest on a smaller one. See 
if you can produce a solution. Indeed, it is not even apparent that a solution exists. 

Let us see if we can develop a solution. Instead of focusing our attention on a 
~~“ solution for five disks, let us consider the general case of n disks. Suppose that we had a 
. Solution for 2 — 1 disks and could state a solution for n disks in terms of the solution for 
n — | disks. Then the problem would be solved. This is true because in the trivial case 
of one disk (continually subtracting 1 from n will eventually produce 1), the solution 
~ is simple: merely move the single disk from peg A to peg C. Therefore we will have 
developed a recursive solution if we can state a solution for n disks in terms of n — 1. 
See if you can find such a relationship. In particular, for the case of five disks, suppose 
that we knew how to move the top four disks from peg A to another peg according to 
the rules. How could we then complete the job of moving all five? Recall that there are 
three pegs available. 

Suppose that we could move four disks from peg A to peg C. Then we could move 
them just as easily to B, using C as auxiliary. This would result in the situation depicted 

- in Figure 3.3.2a. We could then move the largest disk from A to C (Figure 3.3.2b) and 
finally again apply the solution for four disks to move the four disks from B to C , using 


A B (a 


Figure 3.3.1 Initial setup of the Towers of Hanoi. 


142 Recursion Chap. 3 


(a) 


(b) 


> 
& 
a 


(c) 


Figure 3.3.2 Recursive solution to the Towers of Hanoi. 


the now empty peg A as an auxiliary (Figure 3.3.2c). Thus, we may state a recursive 
solution to the Towers of Hanoi problem as follows: 
To move n disks from A to C, using B as auxiliary: 


1. Ifn == 1, move the single-disk from A to C and stop. 

2. Move the top n — 1 disks from A to B, using C as auxiliary. 
3. Move the remaining disk from A to C. 

4. Move the n — | disks from B to C, using A as auxiliary. 
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We are sure that this algorithm will produce a correct solution for any Value of 
n. Ifn == 1}, step J will result in the correct solution. If n == 2, we know that we 
already have a solution forn — | == 1, so-that steps 2 and 4 will perform correctly. 
Similarly, when n == 3, we already have produced a solution for.n = 1 == 2, so that 
steps 2 and 4 can be performed. In this fashion, we can show that the solution-works 
forn == 1,2, 3, 4, 5, ...up to any value for which we desire a solution. Notice that 
we developed the solution by identifying a trivial case (n == 1) and a solution for a 
general complex case (n) in terms of a simpler case (n — 1). 

How can this solution be converted into a C program? We are no longer dealing 
with a mathematical function such as factorial, bux rather with concrete actions such as 
“move a disk.” How are we to represent such actions in the computer? The problem is 
not completely specified, What are the inputs to-the program? What are its outputs to 
be? Whenever you are told to write a program, you must receive specific instructions 
about exactly what the program is expected to do. A problem Statement such as “Solve 
the Towers of Hanoi problem” is quite insufficient. What is usually meant when such 
a problem is specified is that not only the program but also the inputs and outputs must 
be designed, so that they reasonably correspond to the problem description. 

The design of inputs and outputs is an important phase of a solution and should 
be given as much attention as the rest of a program. There are_two-reasons for this. 
The first is that the user (who must ultimately evaluate and pass judgment on your 
work) will not see the elegant method that you incorporated in your program but will 
struggle mightily to decipher the output or to adapt the input data to your particular input 
conventions. The failure to agree early on input and output details has been the cause of 
much grief to programmers and users alike. The second reason is that a slight change 
in the input or output format may make the program much simpler to design. Thus, 
the programmer can make the job. much easier if he or she is able to design an input 
or output format compatible with the algorithm. Of course these two considerations, 
convenience to the user and convenience to the programmer, often conflict sharply, and 
some happy medium must be found. However, the user as well as the programmer must 
be a full participant in the decisions on input and output formats. : 

Let us, then, proceed to design the inputs and outputs for this program. The only 
input needed is the value of n, the number of disks. At least that may be the program- 
mer’s view. The user may want the names of the disks (such as “red,” “blue,” “green,” 
and.so forth) and perhaps the names of the pegs (such as “left,” “right,” and “middle”) 
as well. The programmer can probably convince the user that naming the disks 1, 2, 3, 
-++ , Mand the pegs A, B, Cis just as convenient. If the user is adamant. the programmer 
can write a small function to convert the user's names to his or her own and vice versa. 

4 reasonable form for the output would be a list of statements such as 


move disk nnn from peg yyy to peg zzz 


where nnn is the number of the disk to be moved, and yyy and zzz are the names of the 
pegs involyed. The action to be taken for a solution would be to perform each of the 
output statements in the order that they appear in the output. 

The programmer then decides to write a subroutine rowers (being purposely vague 


‘about the parameters at this point) to print the aforementioned output. The main program 
vould be 
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void main() . . PN ‘ead „$? 
int n; ! > bisg hi : éi 


scanf("%d", &n);.. r , Wage 3% bei 
towers (parameters); . 2 
}/* end main */ ‘e ' 


‘Let us assume that the user will be satisfied to name the disks Ips, on 
and the pegs A, B, and C. What should the parameters to towers be? Clearly, they 


reason for labeling the smallest disk 4; logically the largest disk could have been la- 
beled l and the smallest disk n. However, since it leads to a simpler and more di- 
rect program, we choose to label the disks so that the smallest disk has the smallest 
number. 

What are the other Parameters to towers? At first glance, it: might appear that 
no additional parameters are necessary, since the pegs are named A, B, and C by 
default. However, a closer look at the ‘recursive solution leads us to the realization 
that on the recursive calls disks will not be moved’ from A ‘to C using B as auxil- 
iary but rather from A-to B using C (step 2) or from B to C using A (step 4), We 
therefore include three more parameters in towers, The first, Frompeg, represents 
the peg from which we are removing disks; the ‘second, topeg, represents the peg 
to which we will take the disks; and the third, auxpeg, represents, the auxiliary peg. . 
This situation is one which is quite typical of recursive routines; additional parame- 
ters are necessary to handle the recursive call situation. We already saw one exam- 
ple of this in the binary search program where the.parameters Jow and high*were 
necessary. - A pay of t ps me 

The complete program to solve the Towers of Hanoi problem, closely following 
the recursive solution, may be written as follows; o> 4 gs a>. o; ` 


#include <stdio.h> 
void towers(int, char, char, char); > .. 
void main() 

int n; 

scanf("Xd", ên); 

towers(n, 'A', 'C', 'B'); 


}/* end main */ 
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void towers(int n, char frompeg, char vopeg, char auxpeg) 
{ 
/* If only one disk, make the move and return, */ 


if (n == 1) { 
printf("\nXs%cX%sXc", “move disk 1 from peg ", frompeg, " to peg ", topeg); - 
return; 

} /* end if */ 

/* Move top n-1 disks from A to B, using C as */ 

pe auxiliary */ 

towers(n-1, frompeg, auxpeg, topeg); 

i move remaining disk from A to C +/ 


printf ("\n%s%d%s%cXs%c", “move disk ", n, " from peg ” 


/* Move n-1 disk from B to C using A as t7 
/* ; auxiliary */ 
towers(n-1, auxpeg, topeg, frompeg) ; 

}/* end towers */ ` 


s Trace the actions of the foregoing program when it reads the value 4 for n. Be 
‘ careful to keep track of the changing values of the parameters frompeg, auxpeg, and 
sopeg. Verify that it produces the following output: = 


move disk 1 from peg A to peg B 


move disk 2 from peg A to peg C 
move disk 1 from peg B to peg C 
move disk 3 from peg A to peg B 
„mave disk 1 from peg C to peg A 
J move disk-2 from peg C to peg B 
move disk 1 from peg A to peg B 
move disk 4 from peg A to peg C 
move disk 1 from peg B to peg C 
move disk 2 from peg B to peg A 
move disk 1 from peg C to peg A 
move disk-3 from peg B to peg C 
move disk 1 from peg A to peg B 
move disk 2 from peg A to peg € 
move disk 1 from peg B to peg C 


Verify that the foregoing solution actually works and does not violate any of the rules. 
Translation from ‘Prefix to Postfix Using Recursion 


Let us examine another problem for which the recursive soiution is the most 
direct and elegant one. This is the problem of converting a prefix expression to post- 
fix. Prefix and postfix notation were discussed in the last chapter. Briefly, prefix and 
postfix notation are methods 'of writing mathematical expressions without parentheses. 
In prefix notation each operator immediately precedes its operands. In postfix notation 
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each operator immediately follows its operands. To refresh your memory, here are 
a few conventional (infix) mathematical expressions with their prefix and postfix 
equivalents: 


infix prefix Postfix 


Se 
A+B +AB AB+ 

A+ReC +A*BC ABC * + 
A*(B+C) *A + BC ABC +» 

A*B+C + * ABC ABeC+ 
A+BeC+D—EeF —++A*BCDeEF ABC *+D+ EF«- 


(A + B)*(C+D-E)*F **+AB-~+CDEF ` AB+CD+E-#Fs 
se 


The most convenient way to define postfix and prefix is by using recursion. As- 
suming no constants and using only single letters as variables, a prefix expression is 
a single letter, or an operator followed by two prefix expressions. A postfix expression 
may be similarly defined as a single letter, or cs an operator preceded by two postfix 
expressions. The above definitions assume that all operations are binary—that is, that 
each requires two operands. Examples of such operations are addition, subtraction, mul- 


our problem. Given a prefix expression, how can we convert it into a postfix expression? 
We can immediately identify a trivial case: if a prefix expression. consists of only a 
single variable, that expression is its own postfix equivalent. That is, an expression 
such as A is valid as both a prefix and a postfix expression. 

Now consider a longer prefix String. If we knew how to convert any shorter prefix 
String to postfix, could we convert this longer prefix string? The answer is yes, with 


postfix and appending it to the end of the first converted operand, and finally appending 
the initial operator to the end of the resultant String. Thus we have developed a recursive 
algorithm for converting a prefix string to postfix, with the single provision that we must 
specify a method for identifying the operands in a prefix expression: We can summarize 
our algorithm as follows: t 


1. If the prefix string is a single variable, it is its own postfix equivalent. 
2. Let op be the first operator of the prefix string. 
3. Find the first operand, opnd1, of the string. Convert it to postfix and call it post1. 
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4. Find the second operand, opna2, of the string. Convert it to postfix and call it 
post2, 


5. Concatenate post), post2, and op. 


One operation that will be required in this program is that of concatenation. For 
example, if two strings represented by a and b represent the stings “abcde” and “xyz” 
respectively, the function call 


strcat(a, b) 


places into a the string “abcdexyz” (that is, the string consisting of all the elements of a 
followed by all the elements of b). We also require the functions strlen and substr. The 
function strlen(str) returns the length of the string str. The substr(s1,ij,s2) function 
sets the string s2 to the substring of s1, starting at position į containing j characters. 
For example, after executing substr(“abcd”,1,2,s), s equals “bc”. The functions strcat, 
strlen, and substr are usually standard C string library functions. 

Before transforming the conversion algorithm into a C program, let us examine 
its inputs and outputs. We wish to write a procedure convert that accepts a character 
string. This.string represents a prefix expression in which all variables are single letters 
and the allowable operators are ‘+’, ‘—", ‘*’, and ‘7’. The procedure produces a string 
that is the postfix equivalent of the prefix parameter. #. 

Assume tthe existence of-a function find that accep.s a string and returns an integer 
that is the length of the longest prefix expression contained within the input string that 
Starts at the beginning of that string: For example, find (“A + CD”) returns 1, since “A” 
is the longest prefix string starting at the beginning of “A + CD”. find(“+ * ABCD + 
GH”) returns 5, since “+ * ABC” is the longest prefix string starting at' the beginning 
of “+ * ABCD + GH”. If no such prefix string exists within the input string starting at 
the beginning of the input string, find returns 0. (For éxample, find (“* + AB”) returns 
0.) This function is used to identify the first and second operands of a prefix operator. 
convert also calls the library. function*isalpha, which determines if its parameter is a 


letter. Assuming the existence of the function find, a conversion routine may be written 
as follows. 


void convert (char ep aes char poste) 

{ 
char phd [MAXLENGTHI,. ‘opnd2[MAXLENGTH} ;” i , 
char. post1[MAXLENGTH]:, post2(MAXLENGTH) ; W MEE tli 
char KempIMAXLENGTH 4, 
char op[1]; 
int length; 
int i, jm, n; 


if (length = strlen(prefix)) == 1) { 
if (isalpha(prefix[0])) { 
/* The prefix string is a single letter. */ 
postfix(0] = prefix[0]; 


14g Recursion Chap. 3 


Angee Ah ilour Ti garut Twite ns te wear: gf) sw 


oa ný i «so postfix[i]= N0" hn iaoi Sl W paT gee XË coping Pae 
i oy ooo return; r graè HIDIS BE W rho Ye % Gaerne ya! i 04} be 
} /*end.if.*/ Hig a Ws yn: Teda gni 1D wijo ganou? pA sbin 
ea printf("\niT legal prefix, ARTANO iiu moony east toiag aditi tase Seale 
= exit); adi ani tyoeltern tit nager ett won SW Sa S ft i 
} /* end if */ OS 
/* The prefix string is longer than a Single */ {Date apio tbat? p 
/* character. Extract the operator and the */ 
ys two operand lengths. i HAARAA On 
op[0] = prefix[0]; Prone: Pa? 


op[1] = '\0'; 8 te 
substr(prefix, 1, length-1, temp); ; 
m = find(temp); 


(0 a (issaiesi rr « fps’ > Di 
substr(prefix, m + 1, length-m-1, temp); Ay tki 
n = find(temp); R ee, 


Il (wv == 0 ell (n= 0) duke Gm tin Lots length)) { 
printf("\nillegal prefix string"); yarscader . 
exit(1); “ (1) pms: 

} /* end if */ 


SE gopio] 1e "4! BE 2080) Paik J= 18! gdopl0} te 7") 


* hassaga seat sir BAY? eoreienze 


substr(prefix, 1, m, opnd1); Å jelas gz) ti 
substr(prefix, ml, n, opni2); N ayer 
convert(opnd1, postl); (ges: er a lady 
convert(opnd2, post2); ages? è 
strcat(postl, post2); ET EA os @) TF 


strcat(postl, op); $ +. g pusisg ePterg Bay oF 
Substr(post1, 0, length, postfix); barisan paagi oA 
>= Hend convert t —_— E a C 
p p mY Fi-s-trpnal Jee ; yet 
Note that several checks have been incorporated into convert'to ensure that the 
parameter is a valid prefix string. One of the most difficult classes of errors to detect are 
those resulting from invalid inputs and the programmer's neglect to check for validity. 
We now turn our attention to the function find, which accepts a character string and 
a starting position and returns the length of the longest prefix String that is contained 
in that input string starting at that Position. The word “longest” in this definition is 
superfluous, since there is at most one substring starting at a given position of.a given 
string that is a valid prefix expression. © hy BY ok is 
We first show that there is at most’ one valid prefix expression starting at the 
beginning of a string. To see this,’note that it ig trivially true in a string of length 1. As- 
sume that it is true for a short string. Then a long string that contains a prefix expression 
as an initial substring must begin with either a variable, in which case that variable is 
the desired substring. or with an operator. Deleting the initial operator, the remaining 
string is shorter than the original string and can therefore have at most a Single initial 
prefix expression, This expression is the first operand of the initial operator. Similarly, 
the remaining substring (after deleting the first operand) can only have a single initial 
substring that is a prefix expression. This expression must be the second operand. Thus 
we have uniquely identified the operator and operands of the prefix expression Starting 
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at the first character of an arbitrary string, if such an expression exists. Since there is 
at must one valid prefix string starting at the beginning of any string, there is at most 
one such string starting at.any position of an arbitrary string. This is obvious when we 
consider the substring of the given string starting at the given position. 

Notice that this proof has given us a recursive method for finding a prefix expres- 
sion in a string. We now incorporate this method into the function find: 


iat find(char str{}) 
{ 


char temp[MAXLENCTH) ; 
int length; 
iat i, j, m, n; 


if (Clength = strlen(str)) == 0) 
retera (0); 
if (isalpha(str(0}) != 0) 
/* -First character is a letter. */ 
/* That letter is the initial */ 
1. substring. *y 
retera (1); 
/* otherwise find the first operand */ 
if (strlen(str) < 2) 
return (0); 
substr(str, 1, length-1, temp); 
‘m= find(temp); 
if (a == 0 || strlen(str) ==-m) 
/* no valid prefix operand or */ 
/*. no second operand */ 
return (0); 
substr(str, mel, length-m-1, temp); 
n = find(temp); 
if (n == 0) 
` petura (0); 
return (m+n+1); 
} /* end find */ 


Make sure that you understand how these routines work by tracing their actions 
on both valid and invalid prefix expressions. More important, make sure that you un- 
derstand how they were developed and how logical analysis led to a natural recursive 
solution that was directly translatable into a C program., 


EXERCISES 


3.3.1. Suppose that another provision were added to the Towers of Hanoi problem: that one 
disk may not rest on another disk that is more than one size larger (for example, disk 
| may only rest on disk 2 or on the ground, disk 2 may only rest on disk 3 or on the 
ground, and so on). Why does the solution in the text fail to work? What is faulty about 
the logic that led to it under the new rules? 
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3.3.2. Provethat the number of moves performed by towers in moving n disks equals 2" — 1. 
Can you find a method of solving the Towers of Hanoi problem in fewer moves? Either 
find such a method for some n or prove that none exists. 


3.3.3. | Define a postfix and prefix expression to include the possibility of unary operators. 
Write a program to convert a prefix expression possibly containing the unary negation 
operator (represented by the symbol ‘@") to postfix. 

3.3.4. Rewrite the function find in the text so that it is nonrecursive and computes the length 
of a prefix string by counting the number of operators and single-lctter operands. 

3.3.5. Write a recursive function that accepts a prefix expression consisting of binary opera- 
tors and single-digit integer operands and returns the valuc of the expression. 

3.3.6. Consider the following procedure for converting a prefix expression to postfix. The 
routine would be called by conv(prefix postfix). 


_ void conv(char prefix[], char postfix{]) 
{ 
char first(2)}; 
Char tl{MAXLENCTH], t2{MAXLENCTH]; 


first(0) = prefix(0); 
first{1) = '\0'; 
subdstr(prefix, 1, strien(prefix) - 1, prefix); 
if (first(O) == '+' || first[u) == '*' || first(0) == '-' || 
J first(0) == '/') { 
conv(prefix, t1); 
conv(prefix, t2); 
strcat(ti, t2); 
strcat(tl, first); 
a substr(tl, 0, strien(tl), postfix); 
return; 
} /* end if */ 
postfix[0] = first[0]; 
postfix[1] = '\0'; 
} /* end conv */ 


Explain how the procedure works. Is it better or worse than the method of the text? 
What happens if the routine is called with an invalid prefix string as input? Can you 
incorporate a check for such an invalid string within convert? Can you design such a 
check for the calling program after convert has returned? What is the value of n after 
convert returns? 

3.3.7. Develop a recursive method (and program it) to compute the number of different ways 
in which an integer k can be written as a sum, each of whose operands is less than n. 

3.3.8. Consider an array a containing positive and negative integers. Define contigsum(i,j) 
as the sum of the contiguous elements afi) through alj) for all array indexes i<=/. 
Develop a recursive procedure that determines i and j such that conrigsum(i.j) is max- 
imized. The recursion should consider the two halves of the array a. 

3.3.9. Write a recursive C program to find the Ath smallest element of an array u of numbers 


by choosing any element a[i] of u and partitioning « into those elements smaller than, 
equal to, and greater than ali]. 
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3.3.10. The eight-queens problem is to place eight queens on a chessboard so, that no queen is 
~ attacking any other queen. The following is a recursive Program to solve the problem. 

board is an eight by eight array’ that represents a chessboard, board{il[ j} == TRUE 

if there is a queen at position [i]{ j], and FALSE otherwise. good() is a function that 

returns TRUE if no two queens on the chessboard are attacking each other and FALSE 
otherwise. At the end of the program, the routine drawboard() displays a solution tò 

the problem. mpeg: a Dg Ea rm 


#define TRUE 1 > on a e's 

. #define FALSE 0 3 

int try(int)- i À sih 
void drawboard(void); -+ i = 


Static short int board>{[8] [8]; ‘oa Sw Yea? Bro 
void main() ~S am 
int i, j; 


for(i=0; i<8; i++) 
For(j=0; j<8; j++) 
board[i] [j] = FALSE; 
if (try(0) == TRUE) 
drawboard(); 
}/* end main */ 


int try(int n) 
int i; 


for(i=0; i<8; i++) { 
board[n] [i] = TRUE; 
if (n == 7-&& good() == TRUE) 
return(TRUE); 
if (n < 7 & good() == TRUE & try(n+1) ==. TRUE) 
return(TRUE) ; 
board[n] [i] = FALSE; 
} /* end for */ 
return(FALSE); 
} /* end try */ 


The recursive function try returns TRUE if it is possible, given the board at the time 
that it is called, to add queens in rows n through 7 to achieve a solution. try returns 
FALSE if there is no solution that has queens at the positions in board that already 
contain TRUE. If TRUE is returned, the function also adds queens in rows n through 7 
to produce a solution. Write the foregoing functions good and drawboard, and verify 
that the program produces a solution. (The idea behind the solution is as follows: bourd 
represents the global situation during an attempt to find a solution. The next step toward 
finding a solution is chosen arbitrarily (place a queen in the next untried position in row 
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n). Then recursively test whether it is pessible to produce a solution that includes that 
Step. If it is, return. If it is not, backtrack from the attempted ne t step—bogrd|n]|i] = ` 
« anii FALSE—and try another possibility: (This method is called backtracking) r in 
3.3.11. A J0 X 10 array maze of Os and’ Is represents a maze in which a traveler must find 
a path from maze{0j[0] to maze[9}[9]. The traveler may move from a Square into any 
adjacent square in the’ same row or column, but may noi skip over any squares or move 
diagonally. In addition, the traveler may not move into any square that contains al. 
maze[0)[0] and maze[9}[9} contain Os: Write d routine which accepts such a maze and 
either prints a message that no path through the maze exists or which prints a list of 
positions representing a path fiom [0][0] to [9][9]. TAi tae 


TEN. yia y bi Aa gaii E TO} SSS < : ab pr a 
3.4 SIMULATING RECURSION (aninakt ai N 
Bini f MERES SiL b VIN AD BEA N 
In this section we examine more closel y some of the mechanisms.used to implement re- 
cursion so that we can simulate these mechanisms using nonrecursive techniques. This 
activity is important for several reasons. First of all, many commonly used program- 
ming languages (such as FORTRAN, COBOL, and many machine languages) do not 
allow recursive programs. Problems such as the Towersjof. Hanoi and prefix-to-postfix 
conversion, whose solutions can be derived and stated quite simply using recursive 
techniques, can be programmed in these langiages by simulating the recursive solu- 
tion using more elementary operations. If we know that the recursive solution is.correct 
(and it is ofter fairly easy to prove such a solution correci) and we have established 
techniques for converting’a recursive Solution to a nonrecursive one, we can create a 
correct solution in a nonrecursive language: It is not uncommon for a programmer to be 
able to state a recursive solution to å problem. The ‘ability to generate a nonrecursive 
solution from a recursive algorithm is indispensable when using a compiler that does 
not support recursion. OP AAS AI Eee ee f 
c a Another reason for examining the implementation of recursion is that it will allow- 
us to understand the implications of recursion and some of its hidden pitfalls. Although 
these pitfalls do not exist in mathematical definitions that employ recursion, they seem 
to be an inevitable accompaniment of an implementation in a real language on a real 
machine. 

Finally, even in a language such as C that does support recursion, a recursive 
solution to a problem ix often more expensive than a nonrecursive solution, both in 
terms of time and space. Frequently, this expense is a small Price to pay for the logical 
simplicity and self-documentation of the recursive solution. However, in a production 
program (such as a compiler, for example) that may be run thousands of times, the 
recurrent expense is a heavy burden on the system’s limited resources. Thus, a program 
may be designed to incorporate a recursive solution in order to reduce the expense of 
design and certification, and then carefully converted to a nonrecursive version to be 
put into actual day-to-day use. As we shall see. in performing such a conversion it is 
often possible to identify parts of the implementation of recursion that are superfluous 
in a particular application and thereby significantly reduce the amount of work that the 
program must perform. i 

Before examining the actions of a recursive routine, let us take a step back and 
examine the action of a nonrecursive routine. We will then be able to see what mech- 
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anisms must be added to support recursion. Before proceeding we adopt the following 
convention. Suppose that we have the statement 


rout(x) ; 
where rout is defined as a function by the header 


rout(a) 


x is referred to as an argument (of the calling function), and a is referred to as a pa- 
rameter (of the called function). 


What happens when a function is called? The action of calling a function may be 
divided into three parts: 


1. Passing arguments 
2. Allocating and initializing local variables 
3. Transferring control to the function 


Let us examine each of these three steps in turn. 

1. Passing arguments. For a parameter in C, a copy of the argument is made 
locally within the function, and any changes to the parameter are made to that local 
copy. The effect of this scheme is that the original input argument cannot be altered. In 
this method, storage for the argument is allocated within the data area of the function. 


2. Allocating and initializing local variables. After arguments have been 
passed. the local variables of the function are allocated. These local variables include all 
those declared directly in the function and any temporaries that must be created during 
the course of execution. For example, in evaluating the expression 


X+y+zZ 


a storage location must be set aside to hold the value of x + y so that z can be added to 

it. Another storage location must be set aside to hold the value of the entire expression 
„ after it has been evaluated. Such locations are called temporaries, since they are needed 

only temporarily during the course of execution. Similarly, in a statement such as 


„X= fact(n) 


Ia 


a temporary must be set aside to hold the value of fact(n) before that value can be 
assigned to x. 


3. Transferring control to the function. At this point control may still not be 
passed to the function because provision has not yet been made for saving the return 
address. \f a function is given control, it must eventually restore control to the calling 
routine by means of a branch. However, it cannot execute that branch unless it KNOWS, 
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the location to which it must return. Since this location is within the calling routine 
and not within the function, the only way that the function can know this address is to 
have it passed as an argument. This is exactly what happens. Aside from the explicit 
- arguments specified by the programmer, there is also a set of implicit arguments that 
~ contain information necessary for the function to execute and retum correctly. Chief 
among these implicit arguments is the return address. The function stores this address 
within its own data area. When it is ready to return control to the calling program, the 
function retrieves the return address and branches to that location. 
Once the arguments and the return address have been passed, control may be 
transferred to the function, since everything required has been done to ensure that the 
function can operate on the appropriate data and then return to the calling routine safely. 


Return from a Function 


When a function returns, three actions are performed. First, the return address is 
retrieved and stored in a safe location. Second, the function's data area is freed. This 
data area contains all local variables (including local copies of arguments), temporaries, 
and the return address. Finally, a branch is taken to the return address, which had been 
previously saved. This restores control to the calling routine at the point immediately 
following the instruction that initiated the call. In addition, if the function returns a 
value, that value is placed in a secure location from which the calling program may 
retrieve it. Usually this location is a hardware register that is set aside for this purpose. 

Suppose that a main procedure has called a function b that has called c that 
has, in turn, called d. This is illustrated in Figure 3.4, 1a, where we indicate that control 
Currently resides somewhere within d. Within each function, there is a location set aside 


(b) 


Figure 3.4.1 Series of procedures calling one another. 
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for the return address. Thus the return address area of d contains the address:of the _ 
instruction in c immediately following the call to d. Figure 3.4.1bshows the situation 
immediately following d's return to c. The return address. within:d has been retrieved 4 
and control has been transferred SOON, COORG 4. a eniai aanita Ni, © 
You may have noticed that the string of return addresses forms'a stack; that is; the 


most’recent return address to be added, to the chain.is the:first.to be removed. :At any 
point, we can only access the return ad ess from within the,function that is currently 
executing, which represents the top of the stack. When the stack is popped (Wat is, when 
‘the function returns), a new top is revealed within the.calling routine. Calling a function 


‘has the effect of pushing an element onto the stack, and returning pops the stack. 
Implementing Recursive Functions naia N rere . 


What must be added .to this description in the-case of a recursive funciién? The 
answer is, surprisingly little, Fae time a recursive function-calls itself. an entirelynew 
data area for that particular call must be allocated. As before, this'data area’contains all 
parameters, local variables, temporaries, and.a return address. The point to remember 
is that in recursion a data area is associated not with a function. alone but with ‘a päi- 
ticular call to'that function. Each call causes anew data area to be allocated, and each 
‘reference to an item in the function's data area is.to the data area'of the most retent-call. 
Similarly, each return causes the. current data area to be freed, and the data’ area allo- 
cated immediately prior to the current area becomes current. This behavior, ofcourse, 
suggests the use ofa stack, > ii 0) an 

In Section 3.1.2, where we described the action of the-recursive factorial function, 
we used a set of stacks to represent the successive allocations of each of the local vari- 
ables and parameters. These stacks may be thought of as separate stacks. one for each 
local variable. Alternatively, avd closer to reality, we may think of all of these stacks as 
a single large stack. Each element of this ‘large stack is an entire.data area containing 
subparts representing the individual local variables or parameters. 

Each time that the recursive routine is called, a new data area is allocated. The pa- 
rameters within this data‘area are initialized to refer to the values of their corresponding 
arguments. The return address within the data area is initialized to the address follow- 
ing the call instruction. Any reference to local variables or parameters is via the current 
data area. 

When the recursive routine returns. the returned value (if any) and the return 
address are saved, the data area is freed, and a branch to the return address is executed. 
The calling function retrieves the returned value (if any), resumes execution, and refers 
to its own data area that is now on top of the stack. 3 ; 

Let us now examine how we can simulate the actions of a recursive function. We 
will need a stack of data areas defined by 


#define MAXSTACK 50; 
struct stack { 

int top; 

Struct dataarea item[MAXSTACK]; 
F 
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The dataarea is itself a structure containing the various items that exist in a data 
area and must be defined to contain the fields required for the particular function’ being 
simulated. a 


t 


% aitoi 


` Simulation of Factorial 


Let us look at a specific example: the factorial function. We present the code for 


that function. including temporary variables explicitly and omitting the test for negative 
input, as follows: : 


int fact(int n) 


{ 


int x, y; 


if (n == 0) 

' return(1); . 
xis nily ? ty . 
y = fact(x)}- f i ? 
return(n * y); : mA iS 


} /* end fact */ 


How are we to detine the data area for this function? It must contain the parameter 
n and the local variables x and v. As we shall see. no temporaries are needed. The data 
area must also contain a return address. In this case, there are two possible points to 
which we might want to return: the assignment of fact(x) to v. and the main program 


that called fact. Suppose that we had two labels and that we let the lubel /abel2 be the 
_ label of a section of code, 


label2; y = result; ‘ii tus ba we: 


>t 


(s 


within the simulating program. Let the label label be the label of a statement 


jabell: return(result); 


X 


This reflects a convention that the variable result contains the value to be returned by an 
invocation of the facr function. The return address will be stored as an integer į (equal 
to either I or 2). To effect a return from a recursive call the ‘statement 


switch(i) { 


case 1: goto labell; 
case 2: goto label2; 


} /* end case */ 


is executed. AS: ifi == 1.a return is executed to the main prograr i that called fact, 


and if i 


== 2, a return is simulated to the assignment of the returned value to the 


variable v in the previous execution of fact. 
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The data area stack for this example can be defined as follows: 


#define MAXSTACK 50 | - 
struct dataarea { 

int param; 

int x; 

long int y; 

short int retaddr; 


struct stack { 
int top; 
struct dataarea item[MAXSTACK]; 


The field in the data area that contains the simulated parameter is called param, rather 
than n, to avoid confusion with the parameter n passed to the simulating function. We 
also declare a current data area to hold the values of the variables in the simulated 
“current” call on the recursive function. The declaration is: 


Struct dataarea currarea; 
In addition, we declare a single variable result by 
Jong int result; 


This variable is used to communicate the returned value of fact from one recursive call 
of fact to its caller, and from fact to the outside calling function. Since the elements on 
the stack of data areas are structures and, as we mentioned earlier, it is more efficient 
to pass structures by reference, we do not use the function pop to pop a data area from 
stack. Instead, we write a function popsub defined by 


void popsub(struct stack *ps, Struct dataarea *parea) 


leave the details as an exercise. 
A return from fact is simulated by the code 


The call popsub(&s, &area) pops the stacks and sets area to the popped element. We 


result = value to be returned; 
i = currarea.retaddr; 
popsub(&s, &currarea); 
switch(i) { 

case 1: goto labell; 

case 2: goto label2; 
} /* end switch */ 
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A recursive call on fact is simulated by pushing the current data area on the stack, 
reinitializing the variables currarea.param and currarea.retaddr to the parameter and 
return address of this call, respectively, and then transferring control to the start of the 

_ Simulated routine. Recall that currarea.x holds the value of n — 1 that is to be the new 


parameter. Recall] also that on a recursive call we wish to eventually return to label 2. 
The code to accomplish this is 


push(&s, &currarea); 

currarea.param = Currarea.x; 

currarea.retaddr = 2; 

goto start; /* start is the label of the */ 
/* start of the simulated routine. */ 


Of course, the popsub and push routines must be written so that they pop and push 
entire structures of type dataarea rather than simple variables. Another imposition of 
the array implementation of stacks is that the variable currarea.y must be initialized to 
some value or an error will result in the push routine upon assignment of currarea.y to 
the corresponding field of the top data area when the program starts. 

When the simulation first begins the current area must be initialized so that cur- 
rarea.param equals n and currarea.retaddr equals | (indicating a return to the calling 
routine). A dummy data area must be pushed onto the stack so that when popsub is 
executed in returning to the main routine, an underflow does not occur. This dummy 
data area must also be initialized so as not to cause an error in the push routine (see the 


last sentence of the preceding paragraph). Thus, the simulated version of the recursive 
fact routine is as follows: uy s P 


struct dataarea { 
int param; 
int x; 
long int y; 
short int retaddr; 
}s 
struct stack { 
int top; 
struct dataarea item[MAXSTACK]; 
}s 


int simfact(int n) 
{ 
Struct dataarea currarea; 
struct stack s; 
short int i; 
long int result; ~ 
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start: 


Jabel2: 


label1: 


S.top = -l; .. 


/* initialize a dummy data area 
currarea.param = 0; 3 
currarea.x = 0; 

Currarea,y . =0; 


currarea.retaddr = 0; 


Yi push the dummy data area onto the stack 
push (&s, &currarea); 


/* set the parameter and the return address of 


*/ 


g4 
Y. 


/* the current data area to their proper values, */ 


currarea.param =n; 
currarea.retaddr = 1; 
/* this is the beginning of the simulated 


/* factorial routine. 
if (currarea.param == 0) { 
/* simulation of return(1); 


result = 1; 
i = currarea.retaddr; 
popsub(&s, &currarea); 
switch(i) { 
case 1: goto labell; 
case 2: goto label2; 
} +" end switch */ 
} /* end if */ 
currarea.x = Currarea.param - 1; 
/* simulation of recursive call to fact 
push(&s, &currarea); 
currarea.param = currarea.x; 


` currarea.retaddr = 2; 


goto start; 

/* This is the point to which we return 
/* from the recursive call. Set currarea.y 
rád to the returned value. 
currarea.y = result; 

/* simulation of return(n * y) 


result = currarea.param.* currarea.y; 
i = currarea.retaddr; 
popsub(&s, &currarea); 
switch(i) { 
case 1: goto labell; 
case 2: goto label2; 
} /* end switch */ 


/* At this point we return to the main routine. 


return(result); 


} /* end simfact */ 


derstand what the program does and how it does it. 
Notice that no space was reserved in the data area for temporaries, since they 
need not be saved for later use. The temporary location that holds the value of n * y 
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a 
*/ 


*/ 


pa 


Recursion 


Trace through the execution of this program for n = 5 and be sure that you un- 
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in the original recursive routine is simulated by the temporary for currarea.param * 
currarea.y in the simulating routine. This is not the case in general. For example, if a 
recursive function funct contained a statement such as 


X=a* funct(b) + c* funct(d); 


the temporary for a * funct(b) must be saved during the recursive call on funct(d). How- 
ever, in the example of the factorial function, it is not required to stack the temporary. 


Improvigg the Simulated Routine 


The foregoing discussion leads naturally to the question of whether all the local 
variables really need to be stacked at all. A variable must be saved on the stack only if 
its value at the point of initiation of a recursive call must be reused after return from that 
call. Let us examine whether the variables n, x, and y meet this requirement. Clearly n 
does have to be stacked. In the statement 


y =n * fact(x); 


the old value of n must be used in the multiplication after return from the recursive call 
on fact. However, this is not the case for x and y. In fact, the value of y is not even 
defined at the point of the recursive call, so clearly it need not be stacked. Similarly, 
although x is defined at the point of call, it is never used again after returning, so why 
bother saving it? 

_ This point can be illustrated even more sharply by the following realization. If x 
and y were not declared within the recursive function fact, but rather were declared as 
global variables, the routine would work just as well. Thus, the automatic stacking and 
unstacking action performed by recursion for the local variables x and y is unnecessary. 

Another interesting question to consider is whether the return address is really 
needed on the stack. Since there is only one textual recursive call to fact,-there is only 
one return address within fact. The other return address is to the main routine that orig- 
inally called fact. But suppose a dummy data area had not been stacked upon initial- 
ization of the simulation. Then a data area is placed on the stack only in simulating a 
recursive call, When the stack is popped in returning from a recursive call, that area is 
removed from the stack. However, when an attempt is made to pop the stack in sim- 
ulating a return to the main procedure, an underflow will occur. We can test for this 
underflow by using popandtest rather than popsub, and when it does occur we can re- 
turn directly to the outside calling routine rather than through a local label. This means 
that one of the return addresses can be eliminated. Since this leaves only a single pos- 
sible return address, it need not be placed on the stack. 

Thus the data area has been reduced to contain-the parameter alone, and the stack 
may be declared by l 


#define MAXSTACK 50 
struct stack { 

int top; 

int param[MAXSTACK] ; 
}; 
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The current data area is reduced to a single variable declared by 


int ‘currparam; 


The program is now quite compact and comprehensible. 
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int simfact(int n) 


start: 


label2: 


label1: 


struct stack, s; 
short int und; 
long int result, y; 
int currparam, x; 


s.top = -l; 
currparam = n; ; 
/* This is the beginning cf the simulated */ 


y factorial routine. #7 
if (currparam == 0) { ' 
/* simulation of return(1) 2y 
result = 1; 


popandtest(&s, &currparam, &und); 
switch(und) { 
case FALSE: goto label2; 
case TRUE: goto labell; 
} /* end switch */ 
} /* end if */ 
/* currparam != 0 */ 
x = Currparam - 1; 
/* simulation of recursive call to fact */ 
push(&s, currparam); 
currparam = x; 


goto start; 

/* This is the point to which we return ay 
/* from the recursive call. Set *y 
/* y to the returned value. */ 
y = result; 

/* simulation of return (n * y); af 


result = currparam * y; 
popandtest(&s, &currparam, und); 
switch(und) { 

case TRUE: goto labell; 

case FALSE: goto label2; © 
} /* end switch */ 
/* At this point we return to the main “/ 
/* routine. ty 
return(result); - 


} /* end simfact */ 
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Eliminating gotos _ 


Although the preceding program is certainly simpler than the previous one, it is 
still far from ideal. If you were to look at the program without having seen its derivation, 
it is probably doubtful that you could identify it as computing the factorial function. The 
statements : 


goto start; 
and 
goto label2; 


are particularly irritating, since they interrupt the faw of thought at a time that one 
might otherwise come to an understanding of what is aappening. Let us see if we can 
transform this program into a still more readable version. 

Several transformations are immediately apparent. First of all, the statements 


popandtest(&s, &currparam, &und); 
switch(und) { $ 

case FALSE: goto label2; 

case TRUE: goto labell; 
} /* end switch */ 


are repeated twice for the two cases currparam == 0 and currparam != 0. The two 
sections can easily be combined into one. j : 

A further observation is that the two variables x and currparam are assigned val- 
ues from each other and are never in use simultaneously; therefore they may be com- 
bined and referred to as one variable x. The same is true of the variables result and y, 
which may be combined and referred to as the single variable y. 

Performing these transformations leads to the following version of si#fact: 


struct stack { 

int top; 

int param[MAXSTACK] ; 
}; 


int simfact(int n) 

{ 
struct stack s; / 
short int und; 
int x; 
long int y; 
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z 
s.top = -1; 
xen; \ 
start: /* This is the beginning of the simulated */ 
is factorial routine. */ 
if (x == 0) 
y=1; i 
else { 
push(&s, x--); 
goto start; 
} /* end else */ 
labell: popandtest(&s, &x, éund); 
if (und == TRUE) 
return(y); 
label2: y *= x; 
goto labell; 
} /* end simfact */ 


We are now beginning to approach a readable program. Note that the program 
consists of two loops: 


1. The loop that consists of the entire if statement, labeled start. This loop is exited 
when x equals 0, at which point y is set to 1 and execution proceeds to the label 
label. 

2. The loop that begins at label /abel! and ends with the statement goto label1. This 
loop is exited when the stack has been emptied and underflow occurs, at which 
point a return is executed. 


These loops can easily be transformed into explicit while loops as follows: 


/* subtraction loop */ 
Start: while (x != 0) 
* push(@s, x--); 
y=]; 
popandtest(&s, &x, &und); 
label1: while (und == FALSE) { 
y *= x; 
popandtest (&s, &x, kund); 
} /* end while * 7 
return(y); 


Let us examine these two loops more closely. x starts off at the value of the input 
parameter n and is reduced by | each time that the subtraction loop is repeated. Each’ 
time x is set to a new value, the old value of x is saved on the stack. This continues 
until x is 0. Thus, after the first loop has been Executed the stack contains, from top to 
bottom, the integers 1 to n. 

The multiplication loop merely removes each of these values from the stack and 
sets y to the product of the popped value and the old value of v. Since we know what 
the stack contains at the start of the multiplication loop, why bother popping the stack? 
We can use those values directly. We can eliminate the stack und the first loop entirely 
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and replace the multiplication loop with a loop that multiplies y by each of the integers 
from 1 to n in turn. The resulting program is ; 


int simfact(int n) . 
{ - i 

int x; 

long int y; 


for (y=x=1; x <= n; x++) 
y *= x; f 
return(y); 
} /* end simfact */ 


But this program is a direct C implementation of the iterative version of the factorial 
function as presented in Section 3.1. The only change is that x varies from | to n rather 
than from nto 1. 


Simulating the Towers of Hanoi 


We have shown that successive transformations of a nonrecursive simulation of 
a recursive routine may lead to a simpler program for solving a problem. Let us now 
look at a more complex example of recursion, the Towers of Hanoi problem presented 
in Section 3.3. We will simulate its recursion and attempt to simplify the simulation 
to produce a nonrecursive solution. We present again the recursive program of Section 


335 
void towers(int n, char frompeg, char topeg, char auxpeg) 


/* If only one disk, make the move and return. */ 
if (n == 1) { 
printf("\n%s%c%s%c", "move disk 1 from peg '", frompeg, 
"to peg ", topeg); 
return; 
} /* end if */ 
/* Move top n-1 disks from A to B, using C as */ 


/* auxiliary a/ 
towers(n-1, frompeg, auxpeg, topeg); 
j/* Move remaining disk from A to C. 4 


printf("\n%sxd%s%cxs%c", "move disk ",-" from peg ", 
frompeg, " to peg ", topeg); 
/* Move n-1 disk from B to C using A as */ 
/* i auxiliary */ 
towers(n-1, auxpeg, topeg, frompeg); 
} /* end towers */ . 


Make sure that you understand the problem and the recursive solution before proceed- 


ing. If you do not, reread Section 3.3. 
There are four parameters in this function, each of which is subject to change in 


a recursive call. Therefore the data area must contain elements representing all four. 
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There are no local variables. There is a single temporary that is needed to hold the value 
of n — 1, but this can be represented by a similar temporary in the simulating program 
and does not have to be stacked. There are three possible points to which the function 
returns on various calls: the calling program and the two points following the recursive 
calls. Therefore four labels are necessary: 


area. 
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start: 

label1; 
label2: 
label13: 


The return address is encoded,as an integer (either 1, 2, or 3) within each data 


. 


Consider the following nonrecursive simulation of towers: 


struct dataarea { 


J; 


int nparam; 

char fromparam; 
char toparam; 
char auxparam; 
short int retaddr; 


struct stack { 


}; 


int top; 
struct dataarea item[MAXSTACK]; 


void simtowers(int n, char frompeg, char topeg, char auxpeg) 


{ 


struct stack s; 

struct dataarea currarea; 
char temp; 

short int i; 


s.top = -1; 
currarea.nparam = 0; 
currarea.fromparam = ' '; 
currarea.toparam = ' '; 
' 


currarea.auxparam = ' '; 
currarea.retaddr = 
/* Push dummy data area onto stack. */ 


push(&s, &currarea); 

/* Set the parameters and the return addresses */ 
/* of the current ee to their proper values. */ 
currarea.nparam = 

currarea. tl frompeg; ` 
currarea.toparam = topeg; 


` currarea.auxparam = auxpeg; 


currarea.retaddr = 1; 


E 
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start: /* This is the start of the simulated routine. */ 
if (currarea.nparam == 1) { 
printf("\nxsxcxs%c", "move disk 1 from peg ", 
currarea.frompeg, " to peg ", currarea.toparam) 
i = currarea.retaddr; 
pop(&s, &currarea); 
switch(i) { 
case 1: goto label; 
case 2: goto label2; 
case 3: goto label3; 
} /* end switch */ 
} /* end if */ 
/* This is the first recursive call. i 
push(&s, &currarea) ; ° 
--currarea.nparam; 
temp = currarea.auxparam; 
currarea.auxparam = currarea.toparam; 
currarea.toparam = temp; 
currarea.retaddr = 2; 
goto start; 
label2; /* We return to this point from the first */ 
= recursive call. */ 
printf("\n%sxd%s%cks%c", “move disk ", currarea.nparam, " from peg ", 
currarea.fromparam, " to peg ", currarea.toparam, 
j* This is the second recursive call. */ 
push(&s, &currarea); 
--currarea.nparam; 
temp = currarea.fromparam; 
currarea.fromparam = currarea.auxparam; 
currarea.auxparam = temp; 
currarea.retaddr = 3; 


goto start; 
label3: /* Return to this point from the second */ 
/* recursive call. 5 Ei 


“i = currarea.retaddr; 

pop(&s, &currarea); 
switch(i) { 

, case 1: goto label1; 
case 2: goto label2; 
case 3: goto label3; 

} /* end switch */ 

Jabell:. return; 
} /* end simtowers */ 


We now simplify the program. First, notice that, three labels are used for return 
addresses: one for each of the two recursive calls and one for the return to the main 
program. However, the return to the main program can be signaled by an underflow in 
the stack, exactly as in the second version of simfact. This leaves two return labels. If we 
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could eliminate one more such label it would no longer be necessary to stack the return 
address, since there would be only one point remaining to which control may be passed 


if the stack is popped Successfully. We focus our attention on the second recursive call 
and the statement 


towers(n-1, auxpeg, topeg, frompeg); 


The actions that occur in simulating this call are as follows: 


1. Push the current data area, al, onto the stack. 


2. Set the parameters in the new current data area, a2, to their respective values, 
n — 1, auxpeg, topeg, and frompeg. , 

3. Set the return label in the current data area, a2, to the address of the statement 
immediately following the call, 


4. Branch to the beginning of the simulated routine. 


After the simulated routine has completed, it is ready to return. The following 
actions occur: 


5. Save the return label, /, from the current data area, a2. 


6. Pop the stack and set the current data area to the popped data area, al. 
7. Branch to /. 


But / is the label of the end of the block of code, since the secon call to towers 
appears as the last statement of the function. Thus, the next step is to pop the stack again 
and return once more. We never again make use of the information in the current- data 
area al, since it is immediately destroyed by popping the stack as soon as it has been 
restored. Since there is no reason to use this data area again, there is no reason to save 
it on the stack in simulating the call. Data need be saved on the stack only if it is to be 
reused. Therefore the second call to towers may be simulated simply by 


1. Changing the parameters in the current data area to their respective values 
2. Branching to the beginning of the simulated routine 


When the simulated routine returns it can return directly to the routine that called 

- the current version. There is no reason to execute a return to the current version, only 

to return immediately to the previous version. Thus we have eliminated the need for 

stacking the return address in simulating the external call (since it can be signaled by 

underflow) and in simulating the second recursive call (since there is no need to save and 

restore the calling routine’s data area at that point). The only remaining return address 
is the one following the first recursive call. 

Since there is only one possible return address left, it is unnecessary to keep it in 
the data area, to be pushed and popped with the rest of the data. Whenever the stack is ` 
popped successfully, there is only one address to which a branch can be executed: the 
Statement following the first call. If an underflow is encountered, the routine returns to 
the calling routine. Since the new values of the variables in the current data area will 
be obtained from the old values in the current data area, it is necessary to declare an 
additional variable, temp, so that values can be interchanged, 
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A revised nonrecursive simulation of towers follows: 


start: 


retaddr: 
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struct dataarea { 
int nparam; 
char fromparam; 
char toparam; 
char auxparam; 
} 
struct stack { 
int top; 
struct dataarea item[MAXSTACK] ; 
} 


void simtowers(int n, char frompeg, char topeg, char auxpeg) 
{ 

struct stack sS; 

struct dataarea currarea; 

short int und; 

char temp; 


s.top = -1; 
currarea.nparam = n; 
currarea.fromparam = frompeg; 
currarea.toparam = topeg; 
currarea.auxparam = auxpeg; 
/* This is the start of the simulated routine. */ 
if (currarea.nparam == 1) { 
printf("\n%s%c%skc"", “move disk 1 from peg " 
currarea.frompeg, " to pég ", currarea. -toparan) 
simulate the return */ 
popandtest(&s, &currarea, &und); 
if (und == TRUE) 
return; 
goto retaddr; 
/* end if */ 
{* simulate the first recursive call £/ 
push(&s, &currarea) ; 
--currarea.nparam; 
temp = currarea.toparam; 
currarea.toparam = currarea.auxparam, 
currarea.auxparam = temp; 


/* 


goto start; 
/* return to this point from the first t7 
/* recursive call f 


printf(" \n%s%džs%c%s%c" “move disk ", currarea. nparam, 
" from peg ", currarea.fromparam, " to peg " 
currarea.toparam) ; 
/* simulation of second recursive call */ 
--currarea.nparam; 
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temp = currarea.fromparam; 
currarea.fromparam = currarea.auxparam; 
currarea.auxparam = temp; 
goto start; 

} /* end simtowers */ 


Examining the structure of the program, we see that it can easily be reorganized 
‘into a simpler format. We begin from the label start. 


while (TRUE) {- ' 
while (currarea.nparam != 1) { 
push(&s, &currarea); 
--currarea.nparam; 
temp = currarea.toparam; 
currarea.toparam = Currarea.auxparam; 
currarea.auxparam = temp; 
} /* end while */ ce 
printf("\n%s%c%s%c", "move disk 1 from peg ", 
currarea.fromparam, " to peg ", currarea.toparam) 
popandtest(&s, &currarea, &und): 
if (und == TRUE) 
return; š 
printf ("\n%s%d%s%c%s%c", "move disk ", currarea.nparam, 
" from ", currarea.fromparam, " to peg ", 
currarea. toparam) 
--currarea.nparam; 
temp = currarea.fromparam; 
currarea.fromparam = currarea.auxparam; 
currarea.auxparam = temp; 
} /* end while */ 


Trace through the actions of this program and see how it reflects the actions of the 
original recursive version. 


EXERCISES . 


3.4.1. Write a nunrecursive simulation of the functions convert and find presented in Section 
3.3: 


3.4.2. ` Write a nonrecursive simulation of the recursive binary search procedure, and transform 
it into an iterative procedure. 

3.4.3, Write a nonrecursive simulation of fib. Can you transform it into an iterative method? 

3.4.4. Write nonrecursive simulations of the recursive routines of Sections 3.2 and 3.3 and the 

. exercises of those sections. 

3.4.5. Show that any solution to the Towers of Hanoi problem that uses a minimum number 
of moves must satisfy the conditions listed below. Use these facts to develop a direct 
‘iterative algorithm for Towers of Hanoi. Implement the algorithm as a C program. 
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+ 1. The first move involves moving the smallest disk. 


A minimum-move solution consists of alternately pees the smallest disk and a 
disk that is not the smallest. 


3. Atany point, there is only one possible move riveting a disk that is not the smallest. 


4. Define the cyclic direction from frompeg to topeg to auxpeg to frompeg as clock- 
wise and the opposite ¢ direction (from frompeg to auxpeg to topeg to frompeg) as 
counterclockwise. Assume that a minimum-move solution to move a k-disk tower 
from frompeg to topeg always moves the smallest disk in one direction. Show that 
a,minimum:move solution to move a (k + 1)-disk tower from jrompeg to topeg 
would then always move the smallest disk in the other direction. Since the solution 
for one disk moves the smallest disk clockwise (the single move from frompeg to ` 
topeg), this means that for an odd number of disks the smallest disk always moves 
clockwise, and for an even number of disks the smallest disk always moves coùn- 
terclockwise. 

5. The solution is completed as soon as all the disks are on a single peg. 


3.4.6. Convert the following recursive program scheme into an iterative version that does not 
use a stack. fin) is a function that returns TRUE or FALSE based on the value of n, and 
g(n) is a function that returns a value of the same type as n (without modifying n). 


int rec(int n) 


if (f(n) == FALSE) { 
/* any group of C statements that */ 
/* do not change the value of n +*/ 
rec(g(n)); 
} /* end if */ 
} /* end rec */ 


Generalize your result to the case in which rec returns a value. -* 


3.4.7. Let f(n) be a function and g(n) and h(n) be functions that return a value of the same type 
as n without modifying n. Let (stmts) represent any group of C statements that do not 


modify the value of n. Show that the recursive program scheme rec is equivalent to the 
iterative scheme iter: 


void rec(int n) 


if (f(n) == FALSE) { 
(stmts); 
' rec(g(n)); 
rec(h(n)); 
_} ¿end if */ 
} /* end rec */ 


struct stack { 

int top; 

int nvalues[MAXSTACK]; 
B. 
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void iter(int n) 
Struct stack s; 


S.top = -1; 
push(&s, n); 
while(empty(&) == FALSE ) { 
n = pop(&s); 
if (f(n) == FALSE) { 
(stmts); 
push(&s, h(n)); 
push(&s, g(n)); 
} /* end if */ ` 
} /* end while */ 
} /* end iter */ 


Show that the if statements in iter can be replaced by the following loop: 


while (f(n) == FALSE) { 
(stmts) 
push(&s, ~ ; 
n= eo 

} /* end wiile */ 


3.5 EFFICIENCY OF RECURSION 


In general a nonrecursive version of a program will execute more efficiently in terms 
of time and space than a recursive version. This is because the overhead involved in 
entering and exiting a block is avoided in the nonrecursive version. As we have seen, 
it is often possible to identity a good number of local variables and temporaries that do 
not have to be saved and restored through the use of a stack. In a nonrecursive program 
this needless stacking activity can be eliminated. However, in a recursive procedure, 
the compiler is usually unable to identify-such variables, and they are therefore stacked 
and unstacked to ensure that no problems arise. 

However. we have also seen that sometimes a recursive solution is the most nat- 
ural and logical way of solving a problem. It is doubtful whether a programmer could 
have developed the nonrecursive solution to the Towers of Hanoi problem directly from 
the problem statement. A similar comment may be made about the problem of convert- 
ing prefix to postfix, where the recursive solution flows directly from the definitions. A 
nonrecursive solution involving stacks is more difficult to develop and more prone to 
error. i 

Thus there is a conflict between machine efficiency and programmer efficiency, 
With the cost of programming increasing steadily, and the cost of computation decreas- 
ing. we have reached the point where in most cases it is not worth a programmer's 
time to laboriously construct a nonrecursive solution to a problem that is most naturally 
solved recursively. Of course an incompetent, overly clever programmer may come up 
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with a complicated recursive solution to a simple problem that.can be solved directly by 
nonrecursive methods. (An example of this is the factorial function or even the binary 
search.) However, if a competent programmer identifies a recursive solution as being 
the simplest and most straightforward method for solving a particular problem, it is 
probably not worth the time and effort to discover a more efficient method. 

However, this is not always the case. If a program is to be run very frequently 
(often entire computers are dedicated to continually running the same program), so that 
increased efficiency in execution speed significantly increases throughput, the extra 
investment in programming time is worthwhile. Even in such cases, it is probably better 
to create a nonrecursive version by simulating and transforming the recursive solution 
than by attempting to create a nonrecursive solution from the problem statement. 

To do this most efficiently, what is required is to first write the recursive routine 
and then its simulated version, including all stacks and temporaries. After this has been 
done, eliminate all stacks and variables that are superfluous. The final version is a refine- 
ment of the original program and is certainly more efficient. Clearly, the elimination of 
each superfluous and redundant operation improves the efficiency of the resulting pro- 
gram. However, every transformation applied to a program is another opening through 
which an unanticipated error may creep in. 

When a stack cannot be eliminated from the nonrecursive version of a program 
and when the recursive version does not contain any extra parameters or local vari- 
ables, the recursive version can be as fast or faster than the nonrecursive version under 
a good compiler. The Towers of Hanoi is an example of such a recursive program. Fac- 
torial, whose nonrecursive version does not need a stack, and calculation of Fibonacci 
numbers, which contains an unnecessary second recursive call (and does not need a 
stack either), are examples where recursion should be avoided in a practical implemen- 
tation, We examine another example of efficient recursion (in order tree traversal) in 
Section 5.2. 

Another point to remember is that explicit calls to pop, push, and empty, as well as 
tests for underflow and overflow, are quite expensive. In fact, they can often outweigh 
the expense of the overhead of recursion. Thus, to maximize actual run-time efficiency 
of a nonrecursive translation, these calls should be replaced by in-line code and the 
overflow/underflow tests eliminated when it is known that we are operating within the 
array bounds. 

The ideas and transformations that we have put forward in presenting the factorial 
function and in the Towers of Hanoi problem can be applied to more complex problems 
whose nonrecursive solution is not readily apparent. The extent to which.a recursive 
solution (actual or simulated) can be transformed into a direct solution depends on the 
particular problem and the ingenuity of the programmer. 


EXERCISES 


3.5.1. Run the recursive and nonrecursive versions of the factorial function of Sections 3.2 anc 
3.4, and examine how much space and time each requires as n becomes larger. 
3.5.2. Do the same as in Exercise 3.5.1 for the Towers of Hanoi problem. 
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T his chapter introduces he queue and the prior ity ueue, two impor tant data structures 
t 
q ’ 
ns. p k d 
often used to simulate real world Situations The concepts of the stack an queue are 


then extended to a ne , . 
w structure the list arious forms of lists and their associated 
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- 4.1 THE QUEUE AND ITS SEQUENTIAL REPRESENTATION 


A queue is an ordered collection of items from which items may be deleted at one end 
(called the front of the queue) and into which items may be inserted at the other end 
(called the rear of the queue). 

Figure 4.1.la illustrates a queue containing three elements A, B, and C. A is at 
ihe front of the queue and C is at the rear. In Figure 4.1 ‘1b an element has been deleted 
from the queue. Since elements may be deleted only from the front of the queue, A is 
removed and B is now at the front. In Figure 4.1.1c, when items D and E are inserted, 
they must be inserted at the rear of the queue. 

Since D was inserted into the queue before E, it will be removed earlier. The first 
element inserted into a queue is the first element to be removed. For this reason @ queue 
is sometimes called a fifo (first-in. first-out) list as opposed to a stack, which is 4 lifo 
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Figure 4.1.1 Queue 


(last-in, first-out) list. Examples of a queue abound in the real world. A line at a bank 
or at a bus stop and a group of cars waiting at a toll booth are all familiar examples of 
queues. 

Three primitive operations can be applied to a queue. The operation insert(q.x) 
inserts item x at the rear of the queue g. The operation x = remove(g) deletes the front 
element from the queue q and sets x to its contents. The third operation, empry(q), re- 
turns false or true depending on whether or not the queue contains any elements. The 
queue in Figure 4.1.1 can be obtained by the follawing sequence of operations. We 
assume that the queue is initially empty. 


insert(q, A); 
insert(q, 8); 


insert(q, QO; (Figure 4.1.1a) 

x = remove(q); (Figure 4.1.1b; x is set to A) 
insert(q, D); 

insert(q, E); (Figure 4.1.1c) 


The insert operation can always be performed, since there is no limit to the number 
of elements a queue may contain, The remove operation, however, can be applied only 
‘if the queue is nonempty; there is no way to remove an element from a queue containing 
no elements. The result of an illegal attempt to remove an element from an empty queue 
is called underflow. The empty operation is, of course, always applicable. 
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The Queue as an Abstract Data Type 


The representation of a queue as an abstract data type is straightforward. We, use 
eltype to denote the type of the queue element and parameterize the queue type with 


eltype. 
abstract typedef <<eltype>> QUEUE(el type); 


abstract empty(q) 
QUEVE(eltype) q; i 
postcondition empty == (len(q) == 0); 


abstract eltype remove(q) 
QUEVE(el type) q; 
precondition empty(q) == FALSE; 
postcondition remove == first(q'); 
q == sub(q', 1, Ten(q') - 1); 


abstract insert(q, elt) 
QUEUE(eltype) q; 

eltype elt; 

postcondition q == q' + <elt; 


C Implementation of Queues 


How shall a queue be represented in C? One idea is to use an array to hold the 
elements of the queue and to use two variables, front and rear, to hold the positions 
within the array of the first and last elements of the queue. We might declare.a queue q 
of integers by , 


#define MAXQUEUE 100 
struct queue { 
int items [MAXQUEUE]; 
int front, rear; 


» W 


Of course, using an array to hold a queue introduces the possibility of overflow 

` if the queue should grow larger than the size of the array. Ignoring the possibility of 

underflow and overflow for the moment, the operation insert(q, x) could be implemented 
by the. statements ma 


q.items [++q:rear] = x; 
and the operation x = remove(q) could be implemented by 


x = q.items [q.front++]; 
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Initially, q.rear is set to —1, and g.front-is set to 0. The queue is empty whenever 
q.rear < q.front. The number of elements in the queue at any time is equal to the value 
of q.rear — q.front + 1. e. 

Let us examine what might happen under this representation. Figure 4.1.2 illus- 
trates an array of five elements used to represent a queue (that is, MAXQUEUE equals 
5). Initially (Figure 4.1.2a), the queue is empty. In Figure 4.1.2b items A, B, and C have 
been inserted. In Figure 4.1.2c two items have been deleted, and in Figure 4.1.2d two 
new items, D and E, have been inserted. The value of g.front is 2, and the value of 
- q.rear is 4, so that there are only 4 — 2 + 1 = 3 elements in the queue. Since the array 
contains five elements, there should be room for the queue to expand without the worry 
of overfiow. ahi i! 

However, to insert F into the queue, g.rear must be increased by 1 to 5 and 
q.items(5] must be set to the value F. But g.items is an array of only five elements, 
so that the insertion cannot be made. It is possible to reach the absurd situation where 
the queue is empty, yet no new element can be inserted (see if you’can come up with 
a sequence of insertions and deletions to reach that situation). Clearly, the array repre- 
sentation outlined in the foregoing is unacceptable. à i 

One solution is to modify the remove operation so that when an item is deleted. 
the entire queue is shifted to the beginning of the array. The operation x = remove(q, 
would then be modified (again; ignoring the possibility of underflow) to 
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x = q.items[0]; 

for (i = 0; i < q.rear; i++) 
q.items[i] = q.items[i+1]}; 

q.rear--; +P- 


The queue need no longer contain a front field, since the element at position 0 of the 
array is always at the front of the queue. The empty queue is represented by the queue 
in which rear equals — 1. 

This method, however, is too inefficient. Each deletion involves moving every 
remaining element of the queue. If a queue contains 500 or 1000 elements, this is clearly 
too high a price to pay. Further, the operation of removing an element from a queue 
logically involves manipulation of only one element: the one currently at the front of the 
queue. The implementation of that operation should reflect this and should not involve 
a host of extraneous operations (see Exercise 4.1.3 for a more efficient alternative). 

Another solution is to view the array that holds the queue as a circle rather than 
as a straight line. That is, we imagine the first element of the array (that is, the element 
at position 0) as immediately following its last element. This implies that even if the 
last element is occupied, a new value can be inserted behind it in the first element of 
the array as long as that first element is empty. l 

Let us look at an example. Assume that a queue contains three items in positions 
2, 3, and 4 of a five element array. This is the situation of Figure 4.1.2d reproduced 
as Figure 4.1.3a. Although the array is not full, its last element is occupied. If item F 
is now inserted into the queue, it can be placed in position 0 of the array, as shown 
in Figure 4.1.3b. The first item of the queue is in g.items{2], which is followed in the 
queue by q.items{3], g.items{4] and q.items{0]. Figure 4.1.3c, d, and e,show the status 
of the queue as first two items C and D are deleted, then G is inserted, and finally E is 
deleted. , 

Unfortunately, it is difficult under this representation to determine when the queue 
is empty. The condition g.rear < q.front is no longer valid as a test for the empty queue, 
since Figure 4.1.3b, c, and d all illustrate situations in which the condition is true yet 
the queue is not empty. 

One way of solving this problem is to establish the convention that the value of 
q-front is the array index immediately preceding the first element of the queue rather 
than the index of the first element itself. Thus since g.rear is the index of the last element 
of the queue, the condition g.front == q.rear implies that the queue is empty. A queue 
of integers may therefore be declared and initialized by 


#define MAXQUEUVE 100. 
struct queue { 
int items [MAXQUEUE]; 
int front, rear; 
X 
struct queue q; 
q.front = q.rear = MAXQUEVE-1; 


7 


Note that q.front and q.rear are initialized to the last index of the array, rather 
than to —1 or 0, because the last element of the array immediately precedes the first one 
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within the queue under this representation. Since g.rear equals g.front, the queue is 
initially empty. 


-The empty function may be coded as 


int empty(struct queue *pq) 
{ 


return ((pq->front == pa- >rear) ? TRUE : FALSE); 
} /* end empty ff 


< 
Once this function exists, a test for the empty queue is implemented by the state- 
ment 
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if (empty(&q)) 

/* queue is empty */ 
else 

/* queue is not empty */ 


The operation remove(q) may be coded as 


int remove(struct queue *pq) 
{ 
if (empty(pq)) { 
printf("queue underflow"); 
exit(1); 
} /* end if */ i 
if (pq->front == MAXQUEUE-1) 
pq->front = 0; 
else 
(pq->front)++; 
return (pq->items(pq- >front]); 
} -/* end remove */ 


Note that pq is already a pointer to a structure of type queue, so-the address operator 


“&” is not used in calling empty within remove. Also note that pg—> front must be - 


updated before an element is extracted. 
Of course, often an underflow condition is meaningful and serves as a signal for 


a new phase of processing. We may wish to use a function remvandtest, whose header 
is A 


void remvandtest(struct queue *pq, int *px, int *pund) 


If the queue is nonempty, this routine sets *pund to FALSE and *px to the element 
removed from the queue. If the queue is empty, so that underflow occurs, the routine 
sets *pund to TRUE. The coding of the routine is left to the reader. 


insert Operation 


The insert operation involves testing for overflow, which occurs when the entire 
array is occupied by items of the queue and an attempt is made to insert yet another 
element into the queue. For example, consider the queue of Figure 4.1.4a. There are 
three elements in the queue: C, D, and E in g.items{2], g.items{3], and q.items[4], re- 
spettively. Since the last item of the queue occupies q.items|4], g.rear equals 4. Since 
the first element of the queue is in q.items{2], q.front equals 1. In Figure 4.1.4b and c, 
items F and G are inserted into the queue. At that point, the array is full and an attempt 
to perform any more insertions causes an overflow. But this is indicated by the fact 
that g.front equals g.rear, which is precisely the indication for underflow. It seems that 
there is no way to distinguish between the empty queue and the full queue under this 
implementation. Such a situation is clearly unsatisfactory. 
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Figure 4.1.4 


One solution is to sacritice one element of the array and to allow a queue to grow 
only as large as one less than the size of the array. Thus. if an array of 100 elements is 
declared as a queue, the queue may contain up to 99 elements. An attempt to insert a 


hundredth element into the queue causes an overflow. The insert routine may then be 
written as follows: 


void insert(struct queue *pq, int x) 
{ = 
/* make room for new element */ 
if (pq->rear == MAXQUEUE-1) 
pq->rear = 0; 
else 
(pg->rear)++; 
/* check for overflow. */ 
if (pq->rear == pq->front) { 
printf("queue overflow"); 
exit(1); 
} /* end if */ 
pq->items[pq->rear] = x; 
return; 
} /* end insert */ 


The test for overflow in insert occurs after pq— rear has been adjusted, whereas 


the test for underflow in remove occurs immediately upon entering the routine, before 
py front is updated. 
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Priority Queue 


Both the stack and the queue are data structures whose elements are ordered based 
on the sequence in which they have been inserted. The pop operation retrieves the last 
clement inserted, and the remove operation retrieves the first element inserted. If there 
is an intrinsic order among the elements themselves (for example, numeric order or 
alphabetic order), it is ignored in the stack or queue operations. 

The priority queue is a data structure in which the intrinsic ordering of the ele- 
ments does determine the results of its basic operations. There are two types of priority 
queues: an ascending priority queue and a descending Priority queue, An ascending 
priority queue is a collection of items into which items can be inserted arbitrarily and 
from which only the smallest item can be removed. If afiq is an ascending priority 
queue, the operation pqinsert(apq.x) inserts element x into apq and pqmindelete(apq) 
removes the minimum element from apg and returns its value. 

A descending priority queue is similar but allows deletion of only the largest 
item. The operations applicable to a descending priority queue. dpq, are pqinsert(dpq,x) 
and pqmaxdelete(dpq). pginsert(dpq.x) inserts element x into dpq and is logically iden- 
tical to pginsert for an ascending priority queue. pqmaxdelete(dpq) removes the maxi- 
mum element from dpq and returns its value. , i 

The operation empty(pq) applies to both types of priority queue and determines 
whether a priority queue is empty. pqmindelete or pgmaxdelete can only be applied to 
a nonempty priority queue [that is, if empry(pq) is FALSE). 

Once pqmindelete has been applied to retrieve the smallest element of an ascend- 
ing priority queue, it can be applied again to retrieve the next smallest, and so on. Thus 
the operation successively retrieves elements of a priority queue in ascending order. 
(However, if a small element is inserted after several deletions, the next retrieva! will 
return that small element, which may be smaller than a previously retrieved element.) 
Similarly, pgmaxdelete retrieves elements of a descending priority queue in descend- 
ing order. This explains the designation of a priority queue as either ascending or de- 
scending. , 

The elements of a priority queue need not be numbers or characters that can be 
compared directly. They may be complex structures that are ordered on one or sev- 
eral fields. For example, telephone-book listings consist of last names, first names, ad- 
dresses, and phone numbers and are ordered by last name. 

Sometimes the field on which the elements of a priority queue are ordered is 
not even part of the elements themselves; it may be a special, external value used 
specifically for the purpose of crdering the priority queue. For example, a stack may 
be viewed as a descending priority queue whose elements are ordered by time of in- 
sertidn. The element that was inserted last has the greatest insertion-time value and 
is the only item that can be retrieved. A queue may similarly be viewed as an as- 
cending priority queue whose elements are ordered by time of insertion. In both cases 
the time of insertion is not part of the elements themselves but is used to order the 
priority queue. 

We leave as an exercise for the reader the development of an ADT specitication 
for a priority queue. We now look at implementation considerations. ` 
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Array Implementation of a Priority Queue 


As we have seen, a stack and a queue can be implemented in an array so that 
each insertion or deletion involves accessing only a single element of the array. Unfor- 
tunately, this is not possible for a priority queue. ; 

Suppose that the n elements of a priority queue py are maintained in positions 0 
to n — | of an array pq.items of size maxpq, and suppose that pg.rear equals the first 


empty array position, n. Then pginsert(pq, x) would seem to be a fairly straightforward 
operation: 


if (pq.rear >= maxpq) { 
orintf("priority queue overflow"); 
exit(1); 

} /* end if */ 

pq. items({pq.rear] = x; 

pq.rear++; 


Note that under this insertion method the elements of the priority queue are not kept 
ordered in the array. 

As long as only insertions take place, this implementation works well. Suppose. 
however, that we attempt the operation pymindelete(pq) on an ascending priority queue. 
This raises two issues. First, to locate the smallest element, every element of the array 
from pq. items|0] through pq.items|pg.rear — 1| must be examined. Therefore a deletion 
requires accessing every element of the priority queue. ; 

Second, how can an element in the middle of the array be deleted? Stack and 
queue deletions involve removal of an item from one of the two ends and do not re- 
quire any searching. Priority queue deletion under this implementation requires both 
searching for the element to be deleted and removal of an elementin the middle of an 
array. 

There are severa! sojutions to this problem, none of them entirely satisfactory: 


1. A special “empty” indicator can be placed into a deleted position. This indica- 
tor can be a value that is invalid as an element (for example, —1 in a priority 
queue of nonnegative numbers), or a separate field can be contained in each array 
element to indicate whether it is empty. Insertion proceeds as befure, but when 
pq.rear reaches maxpq the array elements are compacted into the iront of the array 
and pq.rear is reset to one more than the number of elements. There are several 
disadvantages to this approach. First, the search process to locate the maximum 
or minimum element must examine all the deleted array positions in addition. 
to the actual priority queue elements. If many items have been deleted but no 
compaction has yet taken place, the deletion operation accesses many more ar- 
ray elements than exist in the priority queue. Second, once in a while insertion 
requires accessing every single position of the array, as it runs out of room and 
begins compaction. ; 

2. The deletion operation labels a position empty as in the previous solution, but in- 
sertion is Modified to insert a new item in the first “empty” position. Insertion then 
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involves accessing every array element up to the first one that has been deleted. 
This decreased efficiency of insertion is a major drawback to this solution. 


Each deletion can compact the array by shifting all elements past the deleted 


ww 


element by one position and then decrefnenting Pq.rear by 1. Insertion remains 
unchanged. On the average, half of all priority queue elements are shifted for each 
deletion, so that deletion beeomes quite inefficient. A slightly better alternative 
is to shift either all preceding elements forward or all succeeding elements back- 
ward, depepding on which group is smaller. This would require maintaining both 
front and rear indicators and treating the array as a circular structure, as we did 
for the queue. : > 


> 


Instead of maintaining the priority queue as an unordered array, maintain it as an 
ordered, circular array as follows: 


#define MAXPQ 100 
struct pqueue{ 
` int items [MAXPQ]; 
int front, rear; 
uw} i 


Struct pqueue pq; 


pq-front is the position of the smallest element. pq-rear is „l greater than the posi- 
tion of the largest. Deletion involves merely inereasing py.front (for the ascend- 
ing queue) or decreasing pq.rear (for a descending queue). However, insertion 
requires locating the proper position of the new element and shifting the preced- 
ing or succeeding elements (again, the technique of shifting whichever group is 
smaller is helpful). This method moves the work of searching and shifting from 
the deletion operation to the insertion operation. However, since the array is or- 
dered, the search for the position of the new element in an ordered array is only 
half as expensive on the average as finding the maximum or minimum of the un- 
ordered array, and a binary search might be used to reduce the cost even more. 
_ Other techniques. that involve leaving gaps in the array between elements of the 
priority queue to allow for subsequent insertions are also possible. 


We leave the C implementations of pginsert, pqmindelete, and pymaxdelete tor 


the array representation of a priority queue as exercises for the reader, Seurching ordered 
and unordered arrays is discussed further in Section 7.1. In general. using an array is not 
an efficient method for implementing a priority queue. More efficient implementations 
ure examined in the next section and in Sections 6.3 and 7.3. 


EXERCISES 


4.1.1. Write the function remvandtest(pq, px, pund) which sets *pund to FALSE and «px to 
the item removed from a nonempty queue *pq and sets *pund to TRUE if the queue is 
empty. 
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4.1.2. 


4.13. 


4.1.4. 
4.1.5. 


4.1.6. 


4.1.7. 


4.1.8. 


4.1.9. 


4.1.10. 


4.1.11. 


Exercises 


What set of conditions is necessary and sufficient for a sequence of insert and remove 
operations on a single empty queue fo leave the queue empty without causing under- 
flow? What set of conditions is necessary and sufficient for such a sequence to leave a 
nonempty queue unchanged? dmg 

If an array holding a queue is not considered circular, the text suggests that each remove 
operation must shift down every remaining element of a queue. An alternative method 
is to postpone shifting until rear equals the Jast index of the array. When that situation, 
occurs and an attempt is made to insert an element into the queue, the entire queue is 
shifted down, so that the first element of the queue is in position 0 of the array. What 
are the advantages of this method over performing a shift at each remove operation? 


, What are the disadvantages? Rewrite the routines remove, insert, and empty using this 
method. f ` i 


Show how a sequence of insertions and removals froma queue represented by a linear 


array Can cause overflow to occur upon an attempt to insert an element into an empty 
queue. 


We can avoid sacrificing one element of a queue if a field gempty is added to the queue 
representation. Show how this can be done ‘and rewrite the queue manipulation routines 
under that representation. ` 

How would you implement a queue of stacks? A stack of queues? A queue of queues? 
Write routines to implement the appropriate operations for each ofthese data structures. 
Show how to implement a queue of integers in C by:using an array.gueue[ 100}, where 
queue|(), is used to indicate the front of the queue, gueue[1] is used to indicate its rear, 
and queue{2] through queue[99) are used to contain the queue elements. Show how to 
initialize such an array to represent the empty queue and write routines remove, insert 
and empty for such an implementation. : 


Show how to implement a queue in C in which each item consists of a variable number 
of integers. 7 

A deque is an ordered set.of items from which items may be deleted at either end and 
into which items may be inserted at either end. Call the two ends of a deque left and 
right. How can a deque be represented as a C array? Write four C routines, 


remvleft, remvright, insrtleft, insrtright 


to remove and insert elements at the left and right ends of a deque. Make sure that the 
routines work properly for the empty deque and that they detect overflow and under- 
flow. 

Define an input-restricted deque as a deque (see Exercise 4.1.9) for which only the 
operations remvieft, remvright, and insrtleft are valid, and an output-restricted deque 
as a deque for which only the operations remvleft, insrtleft, and insrtright are valid. 
Show how each of these can be used to represent both a stack and a queue. 

The Scratchemup Parking Garage contains a single lane that holds up to ten cars. Cars 
arrive at the south end of the garage and leave from the north end. If a customer arrives 
to pick up a car that is not the northernmost, all cai’s to the north of the car are moved 
out. the car is driver out, and the other cars are restored in the same order that they 
were in originaliy, Whenever acar leaves. all cars to the south are moved forward 
so that at all times all the empty spaces are in the south part of the garage. Write a 
program that reads a group of input lines. Each line contains an ʻA’ for arrival or a 
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‘D’ fpr departure, and a license plate number. Car#hre assumed to arrive and depart in 
the order specified by the input. The program should print a message each time that a 
car arrives or departs. When a car arrives, the message should specify whether or not 
there is room for the car in the garage. If there is no room for a car, the car waits until 
there is room or until a departure line is read for the car. When room becomes available, 
another message should be printed. When a car departs, the message should include the 
number of times the car was moved within the garage, including the departure itself 
but not the arrival. This number is 0 if the car departs from the waiting line. 

4.1.12. Develop an ADT specification for a priority queue. 

4.1.13, Implement an ascending priority queue and its operations, pginsert, pqmindelete, and 
empty, using each of the four methods presented in the text. 

4.1.14. Show how to sort a set of input numbers using a priority queue and the operations 
pqinsert, pqmindelete, and empty. ; 


4.1.15. Implement a C++ class for a queue using the sequential representation. 


4.2 LINKED LISTS 


What are the drawbacks of using sequential sterage to represent stacks and queues? 
One major drawback is that a fixed amount of storage remains allocated to the stack 
or queue even when the structure is actually using a smaller amount or possibly ro 
storage at all. Further, no more than that fix2d amount of storage may be allocated, thus 
introducing the possibility of overflow. 

Assume that a program uses two stacks implemented in two separate arrays, 
sl.items anc s2.items, Further, assume that each of these arrays has 100 elements. Then 
despite the fact that 200 elements are available for the two stacks, neither can grow be- 
yond 100 ite ms. Even if the first stack contains only 25 items, the second cannot contain 
more than 100. 

One solution to this problem is to allocate a single array items of 200 elements. 
The first stack occupies items{O], items{1],..., items{top1], while the second stack 
is allocatec from the other end of the array; occynying_items[199], items[{198],..., 
items!: 2i. Thus when one stack is not occupying: orage the other stack can use that 
storage. w course, two distinct sets of pop, push, and empty routines are necessary 
for the two stacks. since one grows by incrementing topl, while the other grows by 
decrementing top2. 

Unfortunately, although such a scheme allows two stacks to share a common area, 
no such simple solution exists for three or more stacks or even for two queues. Instead, 
one must keep track of the tops and bottoms (or fronts and rears) of all the structures 
sharing a single large array, Each time that the growth of one structure is about to im- 
pinge on the storage currently being used by another, neighboring structures must be 
shifted within the single array to allow for the growth. 

Ina sequential representation. the items of a stack or queue are implicitly ordered 
by the sequential ordér of storage. Thus. if g.items|x] represents an element of a queue, 
the next element will be g.items|x + 1] (or if x equals MAXQUEUE — 1, g.items[0}). 
Suppose that the items of a stack or a queue weré explicitly ordered, that is, each item 
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Figure 4.2.1 Linear linked list. 


contained within itself the address of the next item. Such an explicit ordering gives 
rise to a data structure pictured in Figure 4.2.1, which is known as a linear linked list. 
Each itent in the list is called a node and contains two fields. an information field and a 
next address field. The information field holds the actual element on the list. The next 
address field contains the address of the next node in the list. Such an address, which is 
used to access a particular node, is known as a pointer. The entire linked list is accessed 
from an external pointer list that points to (contains the address of) the first node in the 
list. (By an “external” pointer, we mean one that is not included within a node. Rather 
its value can be accessed directly by referencing a variable.) The next address field of 
the last node in the list contains a special value, known as null, which is not a valid 
address. This null pointer is used to signal the end of a list. 

The list with no nodes on it is called the empty list or the null list. The value of 
the external pointer List to such a list is the null pointer, A list can be initialized to the 
empty list by the operation list = null. 

We now introduce some notation for use in algorithms (but not in C programs). 
If p is a pointer to a node, node(p) refers to the node pointed to by p, info(p) refers to 
the information portion of that node, and next(p) refers to the next address portion and 
is therefore a pointer. Thus, if next(p) is not null, info(next(p)) refers to the information 
portion of the node that follows node(p) in the list. 

Before proceeding with further discussion of linked lists, we should mention that 
we are presenting them primarily as a data structure (that is, an implementation method) 
rather than as a data type (that is, a logical structure with precisely defined primitive 
operations). We therefore do not present an ADT specification for linked lists here. In 
Section 9.1 we discuss lists as abstract structures and present some primitive operations 
for them. 

In this section, we present the concept of a linked list and show how it is used. In 
the next section, we show how linked lists can be implemented in C. 


Inserting and Removing Nodes from a List 


A list is a dynamic data structure. The number of nodes on a list may vary dra- 

- matically as elements are inserted and removed. The dynamic nature of a list may be 
contrasted with the static nature of an array, whose size remains constant. 

For example, suppose that we are given a list of integers, as illustrated in Figure 

4.2.2a, and we desire to add the integer 6 to the front of that list. That is, we wish to 
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Figure 4.2.2 Adding an element to the front of a list. 


change the list so that it appears as in Figure 4.2.2f. The first step is to obtain a node 
in which to house the additional integer. If a list is to grow and shrink, there must be 
some mechanism for obtaining empty nodes to be added onto the list. Note that, unlike 
an array, a list does not come with a presupplied set of storage locations into which 
elements can be placed. 

Let us assume the existence of a mechanism for obtaining empty nodes. The op- 
eration 


p = getnode(); 
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obtains an empty node and sets the contents of a variable named p to the address of 
that node. The value of p is then a pointer to this newly allocated node. Figure 4.2.2 
illustrates the list and the new node after performing the getnode operation. The details 
of how this operation works will be explained shortly. 


The next step is to insert the integer 6 into the infó, portion of the newly allocated 
node. This is done by the operation 


info(p).= 6; 


The result of this operation is illustrated in Figure 4.2.2c. 

After setting the info portion of node(p), it is necessary to set the next portion of 
that node. Since node(p) is to be inserted at the front of the list, the node that follows 
should be the current first node on the list. Since the variable list contains the address 
of that first node, node(p) can be added to the list by performing the operation 


next(p) = list; 


This operation places the value of list (which is the address of the first node on the list) 
into the next field of node(p). Figure 4.2.2d illustrates the result of this operation. 

At this point, p points to the list with the additional item included. However, since 
list is the external pointer to the desired list, its value must be modified to the address 
of the new first node of the list. This can be done by performing the operation 


list = p; 


which changes the value of list to the value of p. Figure 4.2.2e illustrates the result of 
this operation. Note that Figure 4.2.2e and f are identical except that the value of p is 
not shown in Figure 4.2.2f. This is because p is used as an auxiliary variable during 
the process of modifying the list but its value is irrelevant to the status of the list before 
and after the process. Once the foregoing operations have been performed, the value of 
p may be changed without affecting the list. 


Putting all the steps together, we have an algorithm for adding the integer 6 to the 
front of the list list: 


p = getnode(); 
info(p) = 6; 
next(p) = list; 
list = p; 


The algorithm can obviously be generalized so that it adds any object x to the front 
of a list list by replacing the operation info(p) = 6 with info(p) = x. Convince your- 
self that the algorithm works correctly, even if the list is initially empty 
(list == null), 

Figure 4.2.3 illustrates the process of removing the first node of a nonempty list 
and storing the value of its info field into a variable x. The initial configuration is shown 
in Figure 4.2.3a, and the final configuration is shown in Figure 4.2.3f. The process itself 
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Figure 4.2.3 Removing a node from the front of a list. 


is almost the exact opposite of the process to add a node to the front of a list. To obtain 
Figure 4.2.3d from Figure 4.2.3a, the following operations (whose actions should be 
clear) are performed: 


p = list; (Figure 4.2.3b) 
list = next(p); (Figure 4,2.3c) 
x = info(p); (Figure 4.2.3d) 
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At this point, the algorithm has accomplished what it was supposed to do: the 
first node has been removed from list, and x has been set to the desired value. However. 
the algorithm is not yet complete. In Figure 4.2.3d, JP still points to the node that was 
formerly first on the list. However, that node is currently useless because it 1s no longer 
on the list and its information has been stored in x. (The node is not-considered to be on 
the list despite the fact that nexr(p) points to a node on the list, since there is no way to 
reach node(p) from the external pointer list.) 

The variable p is used as an auxiliary variable during the process of removing 
the first node from the list. The starting and ending configurations of the Jist make no 
reference to p. It is therefore reasonable to expect that p will be used for some other 
purpose in a short while after this operation has been performed. But once the value of 
p is changed there is no way to access the node at all, since neither an external pointer 
nor a next field contains its address. Therefore the node is currently useless and cannot 
be reused, yet it is taking up valuable storage. 

It would be desirable to have some mechanism for making nodeip) available for 
reuse even if the value of the pointer p is changed. The operation that does this is 


freenode(p); (Figure 4.2.3e) . 


Once this operation has been performed, it becomes illegal to reference node(p). since 
the node is no longer allocated. Since the value of p is a pointer to a node that has been 
freed, any reference to that value is also illegal. 

However, the node might be reallocated and a pointer to it reassigned to p by 
the operation p = getnode(). Note that we say that the node “might be” reallocated, 
since the getnode operation returns a pointer to some newly allocated node. There is no 
guarantee that this new node is the same as the one that has just been freed. 

Another way of thinking of getnode and Freenode is that getnode creates a new 
node, whereas freenode destroys a node. Under this view, nodes are not used and 
reused but are rather created and destroyed. We shall say more about the two operations 
getnode and freenode and about the concepts they represent in a moment, but first we 
make the following interesting observation. 


Linked Implementation of Stacks 


The operation of adding an element to the front of a linked list is quite similar 
to that of pushing an element onto a stack. In both cases, a new item is added as the 
only immediately accessible item in a collection. A stack can be accessed only through 
its top element, and a list can be accessed only from the pointer to its first element. 
Similarly, the operation of removing the first element from a linked list is analogous to 
popping a stack. In both cases the only immediately accessible item of a collection is 
removed from that collection, and the next item becomes immediately accessible. 

Thus we have discovered another way of implementing a stack. A stack may 
be represented by a linear linked list. The first node of the list is the top of the stack. 


If an external pointer s points to such a linked list, the operation push(s,x) may be 
implemented by 
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p = getnode(); 
info(p) = x; 
next(p) = s; 
$= Di 
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The operation empty(s) is merely a test of whater s equats null, The operation 
x = pop(s) removes the first node from a nonempty list and signals underflow if the list 
is empty: 


if (empty(s)) { 
printf(' stack underflow’ )3 
exit(1); 
x 
else { 
P= s; 
s = next(p); 
x = info(p); 
freenode(p); 
} /* end if */ 


Figure 4.2.4a illustrates a stack implemented as a linked list, and Figure 4.2.4b illus- 
trates the same stack after another element has been pushed onto it. 


id) 


Figure 4.2.4 Stack and queue as linked lists. 
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i . The EER of the'list Pa eee a of stacks is that all acs being used by 

a program can share the’ same available list. When any stack needs a node, it can obtain 
it from the single available list. When any stack no longer needs a node, it returns the 
node to that-same available list. As long as the total amount of’space needed by all the. 

` stacks at any one‘time is less ‘than the amount of space initially available to them all; 

" each stack is able to grow and shrink to any size, No space has been preallocated to any 
-single stack and no stack is using space that it does not need. Furthermore, other data 

` stfuctures such as queued may also share the same set of nodes. 


3 getnode and freenode Operations . 

We now retum to a discussion of the getnode and freenode operations. In an ab- 
Stract, idealized world it is possible to postulate an infinite number of unused nodes 
available for use by abstract algorithms. The getnode operation finds one such node 
and makes it available to the algorithm. Alternatively, the getnode operation may be re- 
garded as a machine that manufactures nodes and never breaks down. Thus, each time 

that getnode is invoked, it presents its caller with a brand new node, different from all 
the nodes previously in use. ; 

In such an ideal world, the freenode operation would bé unnecessary to make 

- a node available for reuse. Why use an old second-hand node when a simple call to 

getnode car produce a new, never-before-used node? The only harm that an unused 
"node can do is to reduce the number of nodes that can possibly be used, but if an infinite 
-supply of nodes is available, such a reduction is meaningless. Therefore there is no 
` redson:to reuse a node. 

Unfortunately, we live in a real world. Computers do not have an infinite amount 
of storage and cannot manufacture more storage for immediate utilization (at least. not 
yet). Therefore there are a finite number of nodes available and it is impossible to use 

` more than that number at any given instant. If it is desired to use more than that number 
over a given period of time, some nodes must be reused. The function of fregnode is to 
make a node that is no longer being used i in its current context available for reuse in a 
different context. 

_ We might think of a finite pool of empty nodés, existing initially: This pool 

cannot be accessed by the programmer except through the getnode and freenode op- 

‘erations. gemnode removes a node from the pool, whereas freenode’ returns a node 
to the pool. Since any unused node is as good as any ‘other, it makes no differ- 
ence which node is retrieved by gémode or where within the pool a node is placed 
by freenode. 

; The most natural form for this pool to take is that of a linked list acting asa stack. 
The list is linked together by the next field in each node. The geinode operation removes 
the first node from this list and makes it available for use. The freenode operation adds 
a node to the front of the list, making it available for reallocation by. the next gemode. 

. The list of available nodes is called the available list: 

What happens when the available list is empty? This means that all nodes are 
currently in use and it is impossible to allocate any more. If a program calls on gemode 
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. e » « 7 
` when the available list is empty, the amount of storage assigned for that program's data 
- Structures is too Small. Therefore; overflow occurs, This is similar to the situation of a 
‘stack implemented in an array Overflowing the array bounds. 

As long as data structures are abstract, theoretical concepts in a world of infinite 
“space,-there is no possibility of overflow. It is only when they are implemented as real 
objects in a finite 2 sa that the possibility of overflow arises. 

Assume that an external pointer avail points to a list of available nodes. Then the 
bperation `- Pow l 


p = getnode(); 


is implemented as follows: eo 


if (avail == null) { 
* printf("overflow"); 
exit(l); 
} ; 
P= avail; 
avail = next(avail); 


` Sinte the possibility of overflow is accounted for in the getnude operation, it need 
not be mentioned in the list implementation of push. If a stack is about to overflow all 
available nodes, the statement p = getmode() within the push operation results in an 
overflow. i 


The implementation of freenode(p) is straightforward: 


next(p) = avail; 
avail = p; 


Linked implementation of Queues Š 


‘Let us now examine how to represent a queue as a linked list. Recall that items 
are deleted from the front of a queue and inserted at the rear. Let a pointer to the first 
eleypent of a list represent the front of the queue. Another pointer to the last element 
of the list represents the rear of the queue, as shown in Figure 4.2.4c. Figure 4.2.4d 
illustrates the same queue after a new item has been inserted, 

Under the list representation, a queue g consists of a list and two pointers, g.front 
and q.rear. The operations empty(q) and x = remove(q) are completely analogous to 
empty(s) and x = pop(s), with the pointer g.front replacing s. However. special attention’ 
is required when the last element is removed from a queue. In that case, g.rear must 

„also be set to null, since in an empty queue both 4.front and q.rear must be null. The 
algorithm for x = remove(q) is.therefore as follows: 
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iF (empty(q)) { 
printf("queue underflow"); 
exit(1); 

} 

p = q.front; 

x = info(p); 

q. front = next(p); 
if (q. front == nul!) 
q. rear = null; 
freenode(p); 

return(x); 


The operation insert(q, x) is implemented by 


p = getnode(); 
info(p) = x; 
next(p) = null; 
if (q.rear == null) 
q.front = p; 
else 
next(q.rear) = p; 
gq. rear = p; 


What are the disadvantages of representing a stack or queue by a linked list? 
Clearly, a node in a linked list occupies more storage than a corresponding element in 
an array, since two pieces of information per element are necessary in a list node (info 
and next), whereas only one piece of information is needed in the array implementation. 
However, the space used for a list node is usually not twice the space used by an array 
element, since the elements in such a list usually consist of structures with many sub- 
fields. For example, if each element on a stack were a structure occupying ten words. 
the addition of an eleventh word to contain a pointer increases the space requirement by 
only 10 percent. Further, it is sometimes possible to compress information and a pointer 
into a single word so that there is no space degradation. 

Another disadvantage is the additional time spent in managing the available list. 
Each addition and deletion of an element from a stack or a queue involves a correspond- 
ing deletion or addition to the available list. 

The advantage of using linked lists is that all the stacks and queues of a program 
have access to the same free list of nodes. Nodes not used by one stack may be used by 


another, as long as the total number of nodes in use at any one time is not greater than 
the total number of nodes available. 


Linked List as a Data Structure 


Linked lists are important not only as a means of implementing stacks and queues 
but as data structures in their own right. An item is accessed in a linked list by travers- 
ing the list from its beginning. An array implementation allows access to the nth item 
in a group using a single operation, whereas a list implementation requires n operations. 
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~ v Figure 4.2.5a 


It is necessary to pass through each of the first n — 1 elcments before reaching the 
nth element because there is no relation between the memory location occupied by an 
element of a list and its position within that list. 

The advantage of a list over an array occurs when it is necessary to insert or delete 
an clement in the middle of a group of other elements. For example, suppose that we 
wished to insert an element x between the third and fourth elements in an array of size 
10 that currently contains seven items (x[0] through x{6]). Items 6 through 3 «must first 
be moved one slot and the new element inserted in the newly available position 3. This 
process is illustrated by Figure 4.2.5a. In this case insertion of one item involves moving 
four items in addition to the insertion itself. If the array contained 500 or 1000 elements, 
a correspondingly larger number of elements would have to be moved. Similarly, to 
delete an element from an array without leaving a gap. all the-elements beyond the 
element delgted must be moved one position. 

On the other hand, suppose the items are stored as a list. If p points to an element 
of the list, inserting a new element after node(p) involves allocating a node. inserting the 
information, and adjusting two pointers. The amount of work required is independent 
of the size of the list. This is illustrated in Figure 4.2.5b. 

* Let insafter(p.x) denote the operation of inserting an item x into a list after a node 
pointed to by p. This operation is implemented as follows: 


q = getnode(); 
infocg) = 

rext(q) = next(p); 
nextip) = q; 
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An item can be inserted only after a given node, not before the node. This is 
because there is no way to Proceed from a given node to its predecessor in a linear list 
without traversing the list from its beginning. To insert an item before node(p), the next 
field of its predecessor must be changed to point to a newly allocated node. But, given p, 
there is no way to find that predecessor. ( However, it is possible to achieve the-effect of 
inserting an element before node(p) by inserting the element immediately after node(p) 
and then interchanging info(p) with the info field of the newly created successor. We 
leave the details for the reader.) , 

Similarly, to delete a node from a linear list it is insufficient to be given a pointer 
to that node. This is because the next field of the node’s-predecessor must be changed ` 
to point to the node's successor, and there is no direct way of reaching the predeces- 
sor of a given node. The best that can be done is to delete a node following a given 
node. (However, it is possible to save the contents of the following node, delete the 
following node, and then replace the contents of the given node with the saved infor- 
mation. This achieves the effect of deleting a given node unless the given node is last in 
the list.) oS 

Let delafter(p,x) denote the operation of deleting the node following node(p) and 
assigning its contents to the variable x. This operation may be implemented as follows: 


G = next(p); 
x = info(q); 
next(p) = next(q); 
freenode(q); 


` 


The freed node is placed onto the available list so that it may be reused in the future. 
Examples of List Operations 


We illustrate these two operations, as well as the push and pop operations for lists, 
with some simple examples. The first example is to delete all occurrences of the number 
4 from a list /ist. The list is traversed in a search for nodes that contain 4 in their info 
fields. Each such node must be deleted from the list. But to delete a node from a list, its 
predecessor must be known. For this reason two pointers, p and q, are used. p is used to 
traverse the list, and q always points to the predecessor of p. The algorithm makes use 
of the pop operation to remove nodes fiom the beginning of the list, and the delafier » 
operation to remove nodes from the middle of the list. 


2g = null; 
p = list; 
while (p != null) { 
if (info(p) == 4) 
if (q == null) { 
/* remove first node of the list */ 
x = pop( list); 
p = list; 
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else { a 
/* delete the node after q and move up p oat 
p = next(p); 
delafter(q, x); 
} /* end if */ 
else { S 
/* continue. traversing the list */ 
q= pi 
p = next(p); 
} /* end if */ 
} /* end while */ 


The practice of using two pointers, one following the other, is very common in 
working with lists. This technique istused in the next example as well. Assume that alist < 
list is ordered so that smaller.items precede larger ones. Such a list is called an ordered 
list. It is desired to insert an item x, into this list in its proper place. The algorithm to do 
so makes use of the push operation to add a nnde to the front of the list and the assaiter 
operation to add a node in the middle of the list: 


.q = null; ; 
for (p = list; p != null & x > info(p); p = next(p)) 
q=p; 
/* at this point, a node containing x must be inserted */ 3 
if (q == null) /* insert-x at the head of the list Si. hee ` 
push( list, x); ; 
else | 
insafter(q,. x); i 4 š ‘ 
This is a very common operation and will be denoted by place(list,'x). 

Let us examine the efficiency of the place operation. How many nodes are ac- 
cessed, on the average, in inserting a new element into an ordered list? Let us assume 
_ that the list contains 7 nodes. Then x can be placed in one of n + 1 positions; that is, it 

can be found to be less than the first element of the list, between the first and the sec> 
ond, ... between the (# — |)st.and the nth,-and:greater than the ath. If x is Jess than the 
first, place accesses only the first node of the list (aside from the new nede containing 
x); that is, it immediately determines that < info(/ist)-and inserts a’ ‘node containing i 
x using: push, If x is between the kth and (k + 1)st element, place accesses the first k 
nodes; only after finding x to be Jess than the contents of the (k + 1)st node is x inserted 
using insafter; If x is greater than the nth element, then all n nodes are’ accessed. 

- Now suppose that it is equally likely that x is inserted into any one of the: | 
n+ 1 possible positions, (If this is true, we say that the insertion is random.) Then. 
„the probability.of inserting at any particular position is In + 1). If the element is . 
inserted between the kth and the (k + 1)st position, the number of accesses isk+ 1.1 - 
the: element is inserted after the nth element, the number of accesses is n. The average - 
number of nodes accessed, A, equals the sum, over all possible insertion positions, of 
the products of the probability of inserting at a particular position and the huimber of 

"accesses FEE to insert p element at that position. Thus 
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When n is large, n/(n +/1) is very’ close to 1, so'that A is approximately n/2+ 1 or 
(n + 2) 2. For large wr, A.is close enough to n 2 that wë often say that the operation 


of randomly inserting an element into an ordered list requires approximately 2 node 
accesses on average. 


List Implementation of Priority Queues 


An ordered list can be used to represent a priority queue. For an ascending priority 
queue, insertion (pqinsert) is implemented by the place operation. which keeps the list 
ordered} and deletion of the minimum element (pqmindelete) is implemented by the 
pop operation, which reméves the: first element from the list. A descending priority 
queue can be implemented by keeping the list in descending, rather than ascending, 
order or by using remove to implement pgmaxdclete. A priority queue implemented as 
an ordered linked list requires examining an average of approximately .'2 nodes for 
insertion, but only one node. for deletion, — l 

An unordered list may also be:used as a priority queue. Such a list requires ex- 
amining only one node for insertion (by implementing pyinsert using push or insert) 
but always requires examining » elements for deletion (traverse the entire list to find 
the minimum or maximum and then delete that node). Thus an ordered list is somewhat 
more efficient than an unordered list in implementing a priority queue. 

The advantage of a list,over an array for implementing a priority queue is that no 
shifting of elements, or gaps are necessary in a list. An item can be inserted into a list 
without moving any other items, whereas this is impossible for an array unless extra 


space is left empty. We examine other, more efficiënt implementations of the priority 
queue in Sections 6.3 and 7.3. i ; 


Header Nodes 


Sometimes it is desirable to keep an extra node at the front of a list. Such a node 
does-not represent an item in the list and is called a header node or a list header. Thë 
. info portion of such a header node might be unused. as i!lustrated in Figure 4.2.6a. More ` 
_ often. the info portion of such a node could be used to keep global information about 
the entire list. For example, Figure 4.2.6b illustrates a list in which the info portion 
of the header node contains the number of nodes (not including the header) inthe list. In 
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such a data structure more work is needed to add or delete an item from the list, since 
the count in the header node must be adjusted. However, the number of items in the list 
may be obtained directly from the header node without traversing the entire list. 

Another example of the use of header nodes is the following. Supposé a factary 
assembles machinery out of smaller units. A particular machine (inventory number 
A746) ‘might be composed of a number of different parts (numbers 8841, K321, A087, 
J492, G593). This assembly could be represented by a list such as the one illustrated in 
Figure 4.2.6c, where each item on the list represents a component and where the header 
node repreSents the entire assembly. The empty list would no longer be represented by 
the null pointer but rather by a list with a single header node, as in Figure 4.2.6d. 

Of course, algorithms for operations such as empty, push, pop, insert, and remove 
must be rewritten to account for the presence of a header node. Most of the routines 
become a bit more complex, but some, like insert, become simpler, since an external - 
list pointer is never null. We leave the rewriting of the routines as an exercise for the , 
reader. The routines insafter and delafier need not be changed at all. In fact, when a 
header node is used, insafter and delafter can be used instead of push and pop, since 
the first item in such a list appears in the node that follows the header node, rather than 
in the first node on the list. 

If the info portion ofa node can contain a pointer, additional possibilities for the 
use of a header node present themselves. For example, the info portion of a list header ` 
might contain a pointer to the last node in the list. as in Figure 4.2.6e. Such an imple- 
mentation simplifies the representation of a queue. Until now, two external pointers, 
front and rear, were necessary for a list to represent a queue. However, now only a 
single external pointer q to the header node of the list is necessary. next(q) points to the 
front of the queue, and info(q) to its rear. 

Another possibility for the use of the info portion of a list header is as a pointer 
to a “current” node in the list during a traversal process. This would eliminate the need 
for an external pointer during traveral. 


EXERCISES 


4.2.1. Write a set of routines for implementing several stacks and queues within a single array. 

4.2.2, What are the advantages and disadvantages of representing a group of items as an array 
versus a linear linked list? f 

4.2.3. Write an algorithm to perform each of the following operations. 

ta) Append an element to the end of a list. 

(b)  Concatenate two lists. 

(c) Free all the nodes in a lisi. 

(d) Reverse a list, so that the last element becomes the first, and so on. 

te) Delete the last element from a list. 

(f) Delete the nth element trom a list. 

(g) Combine two ordered lists into a single ordered list. 

(h) Form a list containing the union of the elements of two lists. 

(i) Form a list containing the intersection of the elements of two lists. 

G) Insert an element atier the wth element of a list. 


+ 
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(k) Delete every second element from a list. 
(D) Place the elements of a list in increasing order. 
_- (m) Return the sum of the integers in a list. 
` (n) Return the number of elements in a list. 
(0) Move node(p) forward n positions in a list. ` 
(P) -Make a second copy of a list: ` ; 

4.2.4. Write algorithms to perform each of the operatioris of the previous exercise on a group 
of elements in contiguous positions of an array. 

4.2.5. What is the average number of nodes accessed in searching for aparticular element in 
an unordered list? In an ordered list? In an unordered array? In an ordered array? 

'4.2.6. Write algorithms for pginsért and pqmindelete for an ascending priority queue imple- 
` mented as an unordered list and as an ordered list. i 
` 4.2.7. Write algorithms to perform each of the operations in Exercise 4.2.3, assuming that each 
list contains a header node containing the number of elements in the list. 

4.2.8. Write an algorithm that returns a poiziter to a node containing element x in a list with 
a header node. The info field of the header should contain the pointer that traverses the 
list. 

4.2.9. Modify the C++ stack template implementation given at the end of Section 2.3 to use 
the pointer representation of stacks. 


4.3 LISTS INC 


Array implementation of Lists 


How can linear lists be represented in C? Since a list is simply a collection of 
nodes, an array of nodes immediately suggests itself. However, the nodes cannot be 
ordered by the array ordering; each must contain within itself a pointer to its successor. 
Thus a group of 500 nodes might be declared as an array node as follows: 


#define NUMNODES 500 
- Struct nodetype { 
int info, next; 
i i i 
Struct nodetype node{NUMNODES] ; 


In this scheme a pointer to a node is represented by afi array index. That is, a 
pointer is an integer between 0 and NUMNODES - 1 that references a particular ele- 
ment of the array node. The null pointer is represented by the integer — 1. Under this 
implementation, the C expression node{p] is used to reference node(p), info(p) is ref- 

` erenced by node[p].info, and next(p) is referenced by node|p].next. nuil is represented 
by —I. > , ae nhs i 3 

For'example, suppose that the variable list represents a pointer to a list. If list has 
the value 7, node{7] is the first node on the list, and node{7].info is the first data item on 
the list. The second node of the list is given by.node[7].next Suppose that node[7].next 
equals 385. Then node[385].info is the second data ifem on the list and node{385).next 

` points to the third node. P 
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The nodes of a list may be scattered throughout the array node in any arbitrary 
order. Each node carries within itself the łocation of its successor unti) the last node jn 
the list, whose next field contains — 1, which is the null pointer. There is no relation 
between the contents of a node and the pointer to it. The pointer, p, to a node merely 
specifies which element of the array node is being referenced; it is node|p].info that 
represents the information contained within that node. 

Figure 4.3.1 illustrates a portion of an.array node that contains four linked fists. 
The list list] starts at node[16] and contains the integers 3, 7, 14, 6, 5, 37, 12. The 
nodes that contain these integers in their info fields are scattered throughout the- array. 
The next field of each node contains the index within the array of the node containing 
the next element of the list. The last node on the list is node[23], which contains the 
integer 12 in its info field and the null pointer (—1) in its next field, to indicate that it is 
last on the list. ‘ 

Similarly, list2 begins at node[4} and contains the a 17 and 26, list3 begins 
at node[11] and contains the integers 31. 197 and 32, and list4 begins at node[3] and 
contains the integers 1, 18, 13, 11,4, and 15. The variables list], list2, list3, and list4 are 
integers representing external pointers to the four lists. Thus, the. fact that the variable 
list2 has the value 4 represents the fact that the list to which it points begins at node[4]. 


list3 = 


list] = 


Figure 4.3.1 Array of nodes contain- 
ing four linked lists. ` 
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Initially, all nodes are unused, since no lists have yet been formed. Therefore they 
must all be placed on the available list. If the global variable avail is used to point to 
the available list, we may initially organize that list as follows: 


avail = 0; 

for (i = 0; i°< NUMNODES-1; i++) 
node(i].next = i + 1; 

node{NUMNODES-1].next = -1; 


The 500 nodes are initially linked in their natural order, so that node{i] points to 
` node{i + 1). node[0] is the first node on the available list. node[1] is the second, and so - 
forth. node[499} is the last node on the list, since node{499].next equals — 1. There is no 
reason other than convenience for initially ordering the nodes in this fashion. We could 
just as well have set node[0].next to 499, node[499].next to 1, node{1].next to 498, and 
so forth, until node[249].next is set to 250 and node[250].next to —1. The important 
point is that the ordering is explicit within the nodes themselves and is not implied by 
some other underlying structure. 

For the remaining functions in this section, we assume that the variables node and 
avail are global and can therefore be used by any routine. 

When a node is needed for use in a particular list, it is obtained from the available 
list. Similarly, when a-node is no longer necessary, it is returned to the available list. 
These two operations are implemented by the C routines getnode and freenode. getnode 
is a function that removes a node from the available list and returns a pointer to it: 


int getnode(void) 


int p; 

s if (avail == -1) { 
printf("overflow\n"); 
exit(1); 

joa 
p = avail; 
avail = node[avail].next; - 
return(p); 
} /* end getnode */ 


If avail equals — when this function is called, there are no nodes available. This means 
that the list structures of a particular program have overflowed the available space. 


The function freenode accepts a pointer to a node and returns that node to the 
available list: 


void freenode(int p) 

{ 
node[p].next = avail; 
avail = 
return; 

} /* end freenode */ 
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The »rimitive operations for lists are straightforward C versions of the corresponding 
algorithms. The routine insafizr accepts a pointer p to a node and an item x as param- 
eters. It first ensures that p is not null and then inserts x into a node following the node 
pointed to by p: 


void insafter(int p, int x) 
{ 
int q; 
if (p == -1) { 
printf("void insertion\n"); 
return; 
} 
q = getnode(); 


node[q]. info = x; 
node[q].next = node[p].next; 
node[p].next = q; 


return; 
} /* end insafter */ 


The routine delafter(p, px), called by the statement delafter(p; &x), deletes the 
node following node(p) and stores its contents in x: 


void delafter(int p, int *px) 
{ 
int Q; 
if ((p == -1) || (node[p].next == -1)) { ' 
printf ("void deletion\n"); 
return; 
} 
q = node[p].next; 
*px = node[q]. info; 
node{p].next = node[q].next; 
freenode(q); 
return; 
} /* end-delafter */ 


Before calling insafter we must be sure that p is not null. Before calling delafter 
we must be sure that neither p nor node[p].next is null. aaa 


timitations of the Array Iniplementation 


As we saw in Section 4.2, the notion of a pointer allows us to build and manip- 
ulate linked lists of various types. The concept of a pointer introduces the possibility 
of assembling a collection of building blocks, called nodes. into flexible structures. By 
altering the values of pointers, nodes can be attached, detached. and reassembled in 
patterns that grow and shrink as execution of a program progresses. 

Under the array implementation, a fixed set of nodes represented by an array is 
established at the start of execution. A pointer to a node is represented by the relative 


ad 
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position of the node within the array. The disadvantage of that approach is twofold. 
First, the number of nodes that are needed often cannot be predicted when a program is 
written. Usually, the data with which the program is executed determines the number 
of nodes necessary. Thus no matter how many elements the array of nodes contains, it 
is always possible that the program will be executed with input that requires a larger 
number. z 

The second disadvantage of the array approach is that whatever number of nodes 
are declared must remain allocated to the program throughout its execution. For exam- 
ple, if 500 nodes of a given type are declared, the amount of storage required for those. 
500 nodes is reserved for that purpose. If the program-actually uses only 100 or even 
10 nodes in its execution the additional nodes are still reserved and their storage cannot 
be used for any other purpose. 

The solution to this problem is to allow nodes that ate dynamic, rather than static. 
That is, when a node is needed, storage is reserved for it, and when it is no longer 
needed, the storage is released. Thus the storage for nodes that are no longer in use 
is available for another purpose. Also, no predefined limit on the number of nodes is 
established. As long as sufficient storage is available to the job as a whole, part of that 
storage can be reserved for use as a node. 


Allocating and Freeing Dynamic Variables 


In Sections 1.1, 1.2, and 1.3, we examined pointers in the C language. ‘If x is any 
object, &x is a pointer to x. If p is a pointer in C, *p is the object to which p points. We can 
use C pointers to help implement dynamic linked lists. First, however, we discuss how 
storage can be allocated and freed dynamically and how dynamic storage is accessed 
in C. 

In C a pointer variable to an integer can be created by the declaration 


int *p; ‘ 
Once a variable p has been declared as a pointer to a specific type of object, it 
must be possible to dynamically create an object of that specific type and assign its 
address to p. . 

This may be done in C by calling the standard library function malloc(size). mal- 
loc dynamically allocates a portion of memory of size size and returns a pointer to an 
item of type char. Consider the declarations 


extern char *malloc(); 
., int pi; 
float *pr; 


The statements 


pi = (int *) malloc(sizeof (int)); 
pr = (float *) malloc(sizeof (float)); 
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dynamically create the integer variable *pi and the float variable *pr. These variables 
are called dynamic variables. In executing these statements, the operator sizeof returns 
the size, in bytes, of its operand. This is used to maintain machine independence. malloc 
can then create an object of that size. Thus malloc(sizeoflint)) allocates storage for 
an integer, whereas malloc(sizeof(float)) allocates storage for a floating-point number. 
malloc also returns a pointer to the storage it allocates. This pointer is to the first byte 


(for example, character) of that storage and is of type char *. To coerce this pointer so . 


that it points to an integer or real, we use the cast operator (int *) or (float *). 


(The sizeof operator returns a value of type inf, whereas the malloc function ex- | 
pects a parameter of type unsigned. To make the program “lint free” we should write . 


pi.= (int *) malloc ((unsigned) (sizeof (int))); 


However, the cast on the dai operator is often omitted.) 


As an example of the use of pointers and the function malloc, consider the fol- 
lowing statements: 


int *py “as 

2 cja x 

3 = (int *) malloc(sizeof (int)); 
4 as k 

5 q= 3 

6 sain ("%d %d \n", *p, .*q); 

7 x= 7; 

8 tq =x; 

9 printf ("%d %d \n", +p ¥*q); i 
10 p = (int *) malloc (sizeof (int)); 
ll tp = S; 

12 printf("%d %d \n", *p *q); 


In line 3, an integer variable is created and its address is placed in p. Line 4 sets the 
value of that variable to 3. Line 5 sets g to the address of that variable. The assignment 
statement in line 5 is perfectly valid, since one pointer variable (q) is being assigned 
the value of another (p). Figure 4.3.2a illustrates the situation after line 5. Note that at 
this point, *p and *g refer to the same variable. Line 6 therefore prints the contents of 
this variable (which is 3) twice. 

Line 7 sets the value of an integer variable, x, to 7 Line 8 changes the value of 
xq to the value of x. However, since p and q both point'to the same variable, *p and *q 
both, have the value 7. This is illustrated in Figure 4.3.2b. Line 9 therefore prints the 
number 7 twice. 

Line 10 creates a new integer variable and places its address in p. The results are 
illustrated.in Figure 4.3.2c. *p now refers to the newly created integer variable that has 
not yet been given a value. q has not been changed; therefore the value of *q remains 7. 
Note that *p does not refer to a single. specific variable. Its value changes as the value 
of p changes. Line 11 sets the value of this newly created variable to 5, as s illustrated i in 
Figure 4.3.2d, and line 12 prints the values 5 and 7. 
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The function free is used in C to free storage of a dynamically allocated variable. 
The statement 


Figure 4.3.2 


free(p); 


makes any future references to the variable *p illegal (unless, of course, a new value is 
assigned to p by an assignment statement or by a call to malloc). Calling free(p) makes 
the storage occupied by *p available for reuse, if necessary. 

[Note: The free function, by default, expects a pointer parameter of type char *. 
To make the statement “lint free,” we should write 


free((char *) p); 
However, in practice the cast on the parameter is often omitted.] 
To illustrate the use of the free function, consider the following statements: 


p = (int *) malloc (sizeof (int)); 
Paj 

q = (int *) malloc (sizeof (int)); 
*q = 8; 

free(p); 

p=; 


ova wne 
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7 q = (int *) malloc (sizeof (int)); 
8 r *q z= 6: 
9 l printf("%d %d \n", 4p. *q); 


The values 8 and 6 are printed. Figure 4.3.3a illustrates the situation after line 4, 
where *p qnd +q have both been allocated and given values. Figure 4,3.3b illustrates 
the effect of line 5, in which the variable to which p points has been freed. Figure 4.3.3 
illustrates line 6, in which the value of p is changed to point to the variable *g. In lines 
7 and 8, the value of q is changed to point to a newly created variable which is given 
the value 6 in line 8 (Figure 4.3.3d). 


Note that if malloc is called twice in succession and its value is assigned to the 
same variable, as in: 


p = (int *) malloc (sizeof (int)); 
èp = 3; 
p = (int *) malloc (sizeof (int)); 
*p=7; 


the first copy of *p is lost since its address was not saved. The space allocated for 
dynamic variables can be accessed only through a pointer. Unless the pointer to the . 
first variable is saved in another pointer, that variable will be lost. In fact. its storage 
cannot even be freed since there is no way to reference it in a call to free. This is an 
example of storage that is allocated but cannot be referenced. 
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The value 0 (zero) can be used ina C program as the null pointer. Any pointer 


variable may be set to this value. Usually, a standard header to a C program includes 
the definition 


#define NULL 0 


to allow the zero pointer value to be written as NULL. This NULL pointer value does 
not reference a storage location but instead denotes the pointer that does not point to 
anything. The value NULL (zero) may be assigned to any pointer variable p, after which 
a reference to *p is illegal. 

“We have noted that a call to free(p) makes a subsequent reference to *p illegal. 
However, the-actual effects of a call to free are not defined by the C language—each 
implementation of C is free to develop its own version of this function. In most C imple- 
mentations, the storage for «p is freed but the value of p is left unchanged. This means 
that although a reference to *p becomes illegal, there may be no way of detecting the il- 
legality. The value of p is a valid address and the object at that address of the proper type 
may be used as the value of *p. p is called a dangling pointer. It is the programmer's 
responsibility never to use such a pointer in a program. It is good practice to explicitly 
set p to NULL after executing free(p). 

One other dangerous feature associated with pointers should be mentioned. If p 
‘and g are pointers with the same value, the variables *p and *q are identical. Both *p and 
*q refer to the same object. Thus, an assignment to *p changes the value of *q, despite 
the fact that neither q nor *q are explicitly mentioned in the assignment statement to 

‘*p. It is the programmer's responsibility to keep track of “which pointers are pointing 
where” and to recognize the occurrence of such implicit results. 


Linked Lists Using Dynamic Variables 


Now that we have the capability of dynamically allocating and freeing a variable, 
let us see how dynamic variables can be used to implement linked lists. Recall that a 
linked list consists of a set of nodes, each of which has two fields: an information field 
and a pointer to the next node in the list. In addition, an external pointer points to the 
first node in the list. We use pointer variables to implement list pointers. Thus, we define 
the type of a pointer and a node by 


struct node { 
int info; 
struct node *next; 


K 
typedef struct node *NODEPTR; 


A node of this type is identical to the nodes of the array implementation except 
that the next field is a pointer (containing the address of the next node in the list) rather 
than an integer (containing the index within an array where the next node in the list is 
- kept). 
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Let us employ the dynamic allocation features to implement linked lists. Instead 
of declaring an array to represent an aggregate collection of nodes, nodes are allocated 
and freed as necessary. The need for a declared collection of nodes is eliminated. 

If we declare : 


NODEPTR p; 
execution of the statement 
p = getnode(); 
should place the address of an acai node into p. We present the function getnode: 


NODEPTR getnade(void) 
{ 


NODEPTR p; 
p = (NODEPTR) malloc(sizeof(struct node)); 
return(p); ee 


4 es 


Note that sizeof is applied to a structure type and returns the number of bytes required- 
for the entire structure. . A j 
Execution of the statement =~ 


freenode(p) ; 


should return the node whose address is at p to available storage. We present the routine 
freenode: 


, Void freenode(NODEPTR p) 
{ 


free(p); 


The programmer need not be concerned with managing available storage. There 
is no longer a need for the pointer avail (pointing to the first available node), since 
the system governs the allocating and freeing of nodes and the system keeps track of 
the first available node. Note also that there is no test in getnode to determine whether 
overflow has occurred. This is because such a ‘condition will be detected during the 
execution of the malloc function and is system dependent. 

Since the routines getnode and freenode are so simple under this implementation, 
they are often replaced by the in-line statements 


p = (NODEPTR) malloc(sizeof (struct node)); 
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and 


free(p); 


The procedures insafter(p,x) and delafter(p,px) are presented below using the dy- 
zamic implementation of a linked list. Assume that list is a pointer variable that points 
tv the first node of a list (if any) and equals NULL in the case of an empty list. 


void insafter(NODEPTR p, int x) 
{ : 


NODEPTR q; 

if.(p == NULL) { 
printf("void insertion\n"); 
exit(1); 


q = getnode(); 
q -> info = x; 
q -> next = p -> next; 
p->next=q; ., 
} /* end insafter */ 


void delafter(NODEPTR p, int px) i 
{ , A 
NODEPTR q; 
if ((p == NULL) || (p -> next == NULL)) { 
printf("void deletion\n"); 
exit(1); 


q = p -> next; 
*px = q -> info; 
p -> next = q -> next; 
freenode(q); 

} /* end delafter */ 


Notice the striking similarity between the preceding routines and those of the 
array implementation presented earlier in this section. Both are implementations of 
the algorithms of Section 4.2. In fact, the only difference between the two versions 
is in the manner in which nodes are referenced. 


Queues as Lists in C 


As a further illustration of how the C list implementations are used, we present C 
routines for manipulating a queue represented as a linear list. We leave the routines for 
manipulating a stack and a priority queue as exercises for the reader. For comparison 
purposes we show both the array and dynamic implemeniation. We assume that struct 
node and NODEPTR have been declared as in the foregoing. A queue is represented as 
a structure: ` 
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Array Implementation Dynamic Implementation 


struct queue { 

int front, rear; 
h }; 
struct queue q; struct queue q; 


struct queue { 
NODEPTR front, rear; 


front and rear are pointers to the first and last nodes of a queue presented as a 
list. The empty queue is represented by front and rear both equaling the null pointer. 
The function empry need check only one of these pointers since, in a nonempty queue, 
neither front nor rear will be NULL. 


int empty(struct queue *pq) int empty(struct queue *pq) 
{ 


return ((pq->front == -1) 
? TRUE: FALSE); 
} /* end empty */ 


return ((pq->front == NULL) 
? TRUE: FALSE); 
} /* end empty */ 


The routine to insert an element into a queue may be written as follows: 


void insert(struct queue *pq, int x) void insert(struct queue *pq, int x) 


{ 


int p; 
p = getnode() ; 
node[p].info = x; 
node[p].next = -1; 
if (pq->rear == -1) 
pq->front = p; 
else 
node(pq->rear].next ='p; 
pq->rear = p; 
} /* end insert */ 


The function remove deletes the first element from a queue and returns its value: 


int remove(struct queue *pq) 
{ 


int p, x; 


if (empty(pq)) { 


printf("queue underflow\n"); 


exit(1); 
} 
p = pq->front; 
x = node(p].info; 
pq->front = node(p].next; 
if (pq->front == -1) 
pg->rear = -1; 
freenode(p) ; 
return(x); 
} /* end remove */ 
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NODEPTR p; 


p = getnode() ; 

p->info = x; 

p->next = NULL; 

if- (pq->rear == NULL) 
pq->front = p; 

else ‘ 
(pq->rear)->next = 9; 

pq->rear = p; 

} /* end insert */ 


int remove(struct queue *pq) 
{ 
NODEPTR p; 
int x; 
if (empty(pq)) { 
printf(“queue underflow\n") ; 
exit(1); 


p = pq->front; 

x = p->info; 

pq->front = p->next; 

if (pq->front == NULL) 
pq->rear = NULL; 

freenode(p) ; 

` peturn(x); 
} /* end remove */ 
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Examples of List Operations in C 


Let us look at several somewhat more complex list opérations implemented in 
C. We have seen that the dynamic implementation is often superior to the array im- 
plementation. For that reason the majority of C programmers use the dynamic imple- 
mentation to implement lists. From this point on we restrict ourselves to the dynamic 
implementation of linked lists, although we might refer to the array implementation 
when appropriate. 

We have previously defined the operation place(list, x), where list points to a 
sorted linear list and x is an element to be inserted into its proper position within the 
list. Recall that this operation is used to implement the operation pqinsert to insert ii 
priority queue.. We assume that we have already implemented the stack operation pusi.. 
The code to implement the place operation follows: 


void place (NODEPTR *plist, int x) 
{ 


NODEPTR p, q; 
q = NULL; 
for (p = *plist; p != NULL & x > p->info; p = p->next) 
q= p; . 


if (q == NULL) /* insert x at the head of the list */ 
push(plist, x); 
else 
insafter(q, x); 
} /* end place */ 


Note that plist must be declared as a pointer to the list pointer, since the value 
of the external list pointer is changed if.x is inserted at the front of the list using the 
push routine. The foregoing routine would be called. by the statement 
place(&list, x);. 


As a second example, we write a function insend(plist.x) to insert the element x` 
at the end of a list list: 


void insend(NODEPTR *plist, int x) 
{ 
NODEPTR p, q; 
p = getnode(); 
p->info = x; 
p->next = NULL; &: 
if (*plist == NULL) : > 
*plist = p; 
else { 
/* search for last node */ 
for (q = *plist; q->next != NULL; q = q->next) 
Q->next = p; y 
} /* end if */ 
} /* end insend */ 
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We now present a function search(list, x) that returns a pointer to the first occurrence 
of x within the list /ist and the NULL pointer if x does not occur in the list: 


, 


NODEPTR search(NODEPTR list, int x) d 
{ . 


NODEPTR p; p 
for (p = list; p != NULL; p = p->next) 
if (p->info == x) 
return (p); 
/* x is not in the list */ 
return (NULL); 
} /*.end search */ 


The next routine deletes all nodes whose info field contains the value x: 


void remvx(NODEPTR *plist, int x) . 
{ 
NODEPTR p, q; 
int y; 
q = NULL; 
p = *plist; . 
while (p != NULL) 
if (p -> info == x) { 
p = p->next; 
if (q == NULL) { 
/* remove first node of the list */ 


freenode(*plist); 
*plist = p; 
else x F 
delafter(q, &y); 
} 
else 
/* advance to next node of list */ 
qQ =p; i ` 


p = p->next; 
} /* end if */ 
} /* end remvx */ 


Noninteger and Nonhomogeneous Lists _ 


Of course, a node on a list need not represent an integer, For example, to represent 
a stack of character strings by a linked list; nodes containing character strings in their 


info fields are needed. Such nodes using the dynamic allocation implementation could 
be declared by $ 


struct node { 


char info[100]; 
struct node *next; 
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A particular application may call for nodes containing more than one item of in- 
formation. For example, each student node in a list of students may contain the follow- 
ing information: the student's name, college identification number, address, grade point 
index, and major. Nodes for such an application may be declared as follows: 


struct node { 

char name[30); | 
char id[9]; 

char address[1007; 
float gpindex; ` 
‘char major(20); 
struct node *next; 


R 


A separate set of C routines must be written to manipulate lists containing each typ” 
node. " 


To represent nonhomogeneous lists (those that contain nodes of different types), 
a union can be used. For example, 


#define INTGR 1 
#define FLT <2 
#define STRING 3 
struct node {° 


int etype; /* etype equals INTER, FLT, or STRING ‘*/ 


/* depending on the type of the gi 
/* corresponding element. + 
union { ; 
int ival; 
float fval; 
char *pval; /* pointer to a string w/ 
} element; 
Struct node *next; 


F; 


defines a node whose items may be either integers, floating-point numbers, or strings, 
depending on the value of the corresponding etype. Sincea union is always large enough 
to hold its largest component, the sizeof and malloc fuhctions can be used to allocate 
storage for the node. Thus the functions gernode and freènode remain unchanged. Of 
course, it is the programmer's responsibility to use the components of a node as ap- 
propriate. For simplicity, in the remainder of this section we assume that a linked list 
is declared to have only homogeneous elements (so that unions are not necessary). We 


examine nonhomogeneous lists, including lists that can contain other lists and recursive 
lists, in‘ Section 9.1, - 


Comparing the Dynamic and Atray Implementations of Lists 


It is instructive to examine the advantages and disadvantages of the dynamic and 
array implementations of linked lists. The major disadvantage of the dynamic imple- 
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mentation is that it may be more time-consuming to call upon the system to allocate 
and free storage than to manipulate a programmer-managed available list. Its majar 
advantage is that a set of nodes is not reserved in advance. for use by a particular group 
of lists. rs 

For example, suppose that a Program uses two types of lists: lists of integers and 
lists of characters. Under the array representation, two arrays of fixed size would im- 
mediately be allocated. If one group of lists overflows its array, the program cannot 
continue. Under the dynamic representation, two node types are defined at the outset, 
but no storage is allocated for variables until needed. As nodes are needed, the system 
is called upon to provide them. Any storage not used for one type of node may be used ° 
for another. Thus as long as sufficient storage is available for the nodes actually present 
in the lists, no overflow occurs. 

Another advantage of the dynamic implementation is that a reference to *p does 
not involve the address computation that is necessary in computing the address of 
node[p}. To compute the address of node[p], the contents of p must be added to the 
base address of the array node, whereas the address of xp is sivé by the contents of p 
directly. a” 


` 


Implementing Header Nodes 


At the end of Section 4.2 we introduced the concept of header nodes that can 
contain global information about a list, such as its length or a pointer to the current or 
last node on the list. When the data type of the header contents is identical. to the type 
of the list-node‘contents, the header can be implemented simply as just another node at 
the beginning of the list. l 

It is also possible for header nodes to be declared as väriables separate from the 
set of list nodes. This is particularly useful when the header contains information of 


a different type than the data in list nodes, For exampie. consider the following set of 
declarations: : 


struct node { 
char info; 
Struct neie *next; 


Pa 

Struct charstr { 

int length; 

struct node “firstchar; 
J s 


struct charstr sl, s2; 


The variables s1 and s2 of type charstr are header nodes for a list of characters. The 
header contains the number of characters in the list (length) and a pointer to the list 
(firstchar). Thus, s! and 52 represent varying-length character strings. As exercises, 


you may wish to write routines to concatenate two such character strings or to extract a 
substring from such a string. 
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EXERCISES 


43.1. 


4.3.2. 


4.3.3. 


4.3.4. 


4.3.5. 
4.3.6. 


4.3.7. 


4.3.8. 


4.3.9. 


4.3.10. 


Implement the routines empty, push, pop, and popandtest using the array and the dy- 
namic storage implementations of a linked stack. 


Implement the routines empty, insert, and remove using a dynamic storage implemen- 
tation of a linked queue. 


Implement the routines empty, pqinsert, and pqmindelete using a dynamic jig in- 
plementation of a linked priority queue. 


Write C routines using both the array and dynamic variable implementations of a linked 
list to implement the operations of Exercise 4.2.3. 


Write a C routine to interchange the mth and nth elem: x of a list. 


Write a routine inssub(/1, il, 12, i2, len) to insert ihe elements of list /2 beginning ~ 
the i2th element and continuing for len elements into the list /1 beginning at position 
i1. No elements of the list /1 are to be removed or replaced. If i1 > lenerh(I1) + 1 
(where length(/1) denotes the number of nodes in the list /1). or if i2 

length(!2), or if il <1, or if i2 < 1, print an error mess: `e. The list 12 snow roi 
unchanged. 


Write aC function search(/, x) that accepts a pointer / to a list of integers and an intecer x 
and returns a pointer to a node containing x, if it exists, and the null pointer otherwise. 
Write another function, srchinsrt(1, x), that adds x to l if it is not found and always 
returns a pointer to a node containing x. 


Write a C program to read a group of input lines, each containing one word. Pririt each 
word that appears in the input and the number of times that it appears. 


Suppose that a character string is represented by a list of single characters. Write a set of 

routines to manipulate such lists as follows (in the following, /], /2. and list are pointers 

to a header node of a list representing a character string, str is an array of characters, 
and i] and i2 are integers): 

(a) strcnvcl(str) to convert the character string szr to a list. This function returns a 
pointer to a header node. 

(b)  strenvic(list, str) to convert a list into a character string. 

(€) _ strpsi(l\, 12) to perform the strpos function of Section 1.2 on two character strings 
represented by lists. This function returns an integer. 

(d) strvrfyl(I1, 12) to determine the first positicn, of the string represented by /1 that 
is not contained in the string represented by /2. This function returns an integer. 

(e)  strsbstr(1\, il, i2) to perform the substr function of Section 1.2 on a character 
string represented by list /1 and integers | l and i2. This function returns a pointer 
to the header node of a list representing a character string, which is the desired 
substring. The list /1- remains unchanged. 

(f) strpsbl(ll, il, i2, 12) to perform a pseudo-substr assignment to list /1. The ele- 
ments of list /2 should replace the i2 elements of /] beginning at position il, The 
list 12 should remain unchanged. 

(g)  strempi(11, 12) to compare two character strings represented by lists. This func- 
tion returns —1 if the character string represented by JI is less than the string 


represented by /2, 0 if they are equal, and | if the string represented by /1 is 
greater. 


Write a function binsrch that diii two parameters, an array of pointers to a group 
of sorted numbers, and a single number. The function should use a binary search (see 
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Section 3.!) to return a pointer to the single number if it is in the group. If the number 
is not present in the group, return the value NULL. 


4.3.11. Assume that we wish to form N lists, where N is a constant. Declare an array list of 


pointers by 
#define N... 
Struct node { 
int info 
struct node *next 
te 


typedef struct node *NODEPTR; 
NODEPTR list [N]; 


Read two numbers from each input line, the first number being the index of the list into 
which the second number is to be placed in ascending order. When there are no more * 
input lines, print all the lists. 


4.4 EXAMPLE: SIMULATION USING LINKED LISTS 


One of the most useful applications of queues, priority queues, and linked, lists is in 
simulation. A simulation program attempts to model a real-world situation in order to 
learn something about it. Each object and action in the real situation has its counterpart 
in the program. If the simulation is accurate—that is, if the program successtully mirrors 
the real world—the result of the program should mirror the result of the actions being 
simulated. Thus it is possible to understand what occurs in the real-world situation 
without actually observing its occurrence. 

Let us look at an example. Suppose that there is a bank with four tellers. A cus- 
tomer enters the bank at a specific time (71) desiring to conduct a transaction with any 
teller. The transaction may be expected to take a certain period of time (12) before it is 
completed. If a teller is free, the teller can process the customer’s transaction immedi- 
ately, and the customer leaves the bank as soon as the transaction is completed. at time 
t1 + 12. The total time spent in the bank by the customer is‘exactly equal to the duration 
of the transaction (12). 

However, it is possible that none of the tellers are free; they are all servicing 
customers who arrived previously. In that case there is a line waiting at each teller’s 
window. The line for a particular teller may consist of a single person—the one cur- 
rently transacting business with the teller—or it may be a very long line. The customer 
proceeds to the back of the shortest line and waits until all the previous customers have 
completed their transactions and have left the bank. At that time the customer may 
trafisact his or her business. The customer leaves the bank at 72 time units after reach- 
ing the front of a teller’s line. In this case the time spent in the bank is 72 plus the time 
spent waiting on line. ; : 

Given such a system, we would like to compute the average time spent by a cus- 
tomer in the bank. One way of doing so is to stand in the bank doorway, ask departing 
customers the time of their arrival and recerd the time of their departure, subtract the 
first from the second for each customer. and take the average over all customers. How- 
ever, this would not be very practical. lt would be difficult to ensure that no customer 
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is overlooked leaving the bank. Furthermore, it is doubtful that most Customers would 
remember the exact time of arrival. 

Instead, we write a program to simulate the customer actions. Each part of the 
real-world situation has its analogue in the program. The real-world action of a customer 
arriving is modeled by input of data. As each customer arrives gwo facts are known: the 
time of arrival and the duration of the transaction (since, presumably, when a customer 
arrives, he or she knows what he or she wishes to do at the bank). Thus the input data 
‘for each customer consists of a pair of numbers: the time (in minutes Since the bank 
opened) of the customer's arrival and the amount of time (again, in minutes) necessary 
for the transaction. The data pairs are ordered by increasing arrival time. We assume at 
least one input line. i 

The four lines in the bank arc represented by four queues. Each node of the queues 
represents a customer waiting on a line, and the node at the front of a queue represents 
the customer currently being serviced by a teller. 

Suppose that at a given instant of time the four lines each contain a specific num- 
ber of customers. What can happen to alter the status of the lines? Either a new customer 
enters the bank, in which case one of the lines will have an additional customer, or the 
first customer on one of the four lines completes a transaction, in which case that line 
‘will have one fewer customer. Thus there are a total of five actions (& customer entering 


plus four cases of a customer leaving) that can change the status of the lines. Each of 
these. five actions is called an event. 


Simulation Process 


The simulation proceeds by finding the next event to occur and effecting the 
change in the queues that mirrors the change in the lines at the bank due to that event. 
To keep track of events, the program uses an ascending priority queue, called the event 
list. This list contains at most five nodes, each representing the next occurrence of one 
of the five types of events. Thus the event list contains one node representing the next 
customer arriving and four nodes representing each of the four customers at the head 
of a line completing a transaction and leaving the bank. Of course, it is possible that 
one or more of the lines in the bank are empty, or that the doors of the bank have been 
closed for the day, so that no more customers are arriving. In such’ cases the event list 
contains fewer than five nodes. l 

An event node representing a customer’s arrival is called an arrival node, and a 
node representing a departure is called a departure node. At each point in the simu- 
lation, it is necessary to know the next event to occur. For this reason, the event list is 
ordered by increasing time of event occurrence, so that the first event node on the list 
represents the next event to occur. Thus the event list is an ascending priority queue 
represented by an ordered linked list. 

The first event to occur is the arrival of the first customer. The event list is therefore 
initialized by reading the first input line and placing an arrival node representing the first 
Customer's arrival on the event list. Initially, of course. all four teller queues are empty. 
The simulation then proceeds as follows: The first node on the event list is removed and 
the changes that the event causes are made to the queues, As we shall soon see, these 
changes may also cause additional events to be placed on the event list. The process of 
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removing the first node from the event list and effecting the changes that it causes is 
repeated until the event list is empty. 

When an arrival node is removed from the event list, a node representing the 
arriving customer is placed on the shortest of the four teller queues. If that customer is 
the only one on a queue, a node representing his or her departure is also placed on the 
event list, since he or she is at the front of the queue. At the same time, the next input 
line is read and an arrival node representing the next customer to arrive is placed on 
the event list. There will always be exactly one arrival node on the event list (as long 
as the input is not exhausted, at which point no more customers arrive), since as soon 
as one arrival node is removed from the event list another is added to it. 

When a departure node is removed from the cvent list, the node representing the 
departing customer is removed from the front of one of the four queues. At that point 
the amount of time that the departing customer has spent-in the bank is computed and 
added to a total. At the end of the simulation, this total will be divided by the number of 
customers to yield the average time spent by a customer, After a customer node has been 
deleted from the front of its queue, the next customer on the queue (if any) becomes the 
one being serviced by that teller and a departure node for that next customer is added 
to the évent list. 

This process continues until the event list is empty, at which point the average 
time is computed and printed. Note that the event list itself does not mirror any part of 
the real-world situation. It is used as part of the program to control the entire process. 
A simulation such as this one, which proceeds by changing the simulated situation in 
response to the occurrence of one of several events. is called an event-driven simula- 
tion. 


Data Structures | 
We now examine the data structures necessary for this program. The Hodes on the 
queues represent customers and therefore must contain fields representing the arrival 
time and the transaction duration, in addition to a next field to link the nodes in a list. 
The nodes on the event list represent events and therefore must contain the time that 
the event occurs, the type of the event. and any other information associated with the 
event, as well as a next field. Thus it would seem that two separate node pools are 
needed for the two different types of node. Two different types of node would entail 
two getnode and freenode routines and two sets of list manipulation routines. To avoid 
this cumbersome set of duplicate routines, let us try to use a single type of node for both 
events and customers. | 


We.can declare such a pool of nodes and a pointer type as follows: 


| 
> 


struct node { 
int- time; 
int duration; 
int type; 
struct node *next; 

}; 

typedef struct node *NODEPTR; 


Queues and Lists Chap. 4 
222 


In a customer node, time is the customer’s arrival time and duration is the trans- 
action’s duration. type is unused in a customer node. next is used as a pointer to link 
the queue together. For an event node, time is used to hold the time of the event's oc- 
currence; duration is used for the transaction duration of the arriving customer in an 
arrival node and is unused in a departure node. type is an integer between —1 and 3, 
depending on whether the event is an arrival (type == —1) ora departure from line 0, 
1, 2, or 3.(type == 0, 1, 2, or 3). next holds a pointer linking the event list together. 


The four queues representing the teller lines are declared as an array by the dec- 
laration <a 5 


struct queue  { 
NODEPTR front, rear; 
int num; 

yi 

struct queue q[4]; 


The variable g[i] represents a header for the ith’teller queue. The num field of a queue 
contains the number of customers on that queue. 3 f 

A variable evlist points to the front of the event list. A variable rottime is used 
to keep track of the total time spent by all customers. and count keeps count of the 
number of customers that have passed through the bank. These will be used at the end 
of the simulation to compute the average time spent in the bank by the.customers. An 


auxiliary variable auxinfo is used to store temporarily the information portion of a node. 
These variables are declared by 


NODEPTR evlist; 
float count,. tottime; 
Struct node auxinfo; 


Simulation Program 


The main routine initializes all lists and queues and repeatedly removes the next 
node from the event list to drive the simulation'until the event Jist is empty. The event list 
is ordered by increasing value of the time field. The program uses the call place(&evlist, 
&auxinfo) to insert a node whose information is given by. auxinfo in its proper place 
in the event list. The main routine also calls popsub(&evlist, &auxinfo) to remove the 
first node from the event list and place its information in auxinfo. This routine is equiv- 
alent to the function pop. These routines must, of course, be suitably modified from the 
examples given in the last section in order to handle this particular type of node. Note 
that evlist, place, and popsub are merely a particular implementation of an ascending 
priority queue and the operations pginsert and pgmindelete. A more efficient represen- 

‘tation of a priority queue (such as we present in Sections 6.3 and 7.3) would allow the 
program to operate somewhat more efficiently. 

The main program also calls on the functions arrive and depart, which effect the 
changes in the event list and the queues caused by an arrival and a departure. Specif- 
ically, the function arrive(atime, dur) reflects the arrival of a customer at time atime 
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with a transaction of duration dur, and the function depart(gindx, dtime) reflects the 


departure of the first customer from queue g[gindx] at time dtime. The codin 


routines will be given shortly. 
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#include <stdio.h>- 
#define NULL 0 
struct node{ 
int duration, time, type; 
Struct node *next; 
Js 
typedef struct node *NODEPTR; 
struct queue { 
NODEPTR front, rear; 
int num; 


J; 


_ Struct queue q[4]; 


struct node auxinfo; 

NODEPTR evlist; 

int atime, dtime, dur, gindx; 
float count, tottime; 


void place(NODEPTR *, struct node +); 

void popsub(NODEPTR *, struct node *); 

void arrive(int, int); 

void depart(int, int); 

void push(NODEPTR *, struct node *); 

void insafter(NODEPTR *, struct node *); 
int empty(NODEPTR); 

void insert(struct queue *, struct node *); 
void remove(struct queve *, struct node +); 
NODEPTR getnode(void); 

void freenode(NODEPTR) ; 


void main() 
{ 
/* initializations */ 
evlist = NULL; 
count = 0; 
tottime = 0; 
for (qindx = 0; qindx < 4; qindx++) { 
q(qindx].num = 0; 
ql{qindx).front = NULL; 
glqindx}.rear = NULL; 
} /* end for */ 
/* initialize the event list with the first arrival */ 
printf("enter time and duration\n"); : 
scanf("%d %d", &auxinfo.time, &auxinfo.duration); 
auxinfo.type = -1;  /* an arrival */ 
place(&evlist, &auxinfo); 
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/* run the simulation as Jong as the event list’ is ‘not empty */ 
while (evlist != NULL) { 
Popsub(&evlist, &auxinfo); 
/* check if the next event is an arrival or departure */ 
if (auxinfo.type == -1) { 
“i an arrival */ 
atime = auxinfo, time; 
dur = auxinfo.duration; 
arrive(atime, dur); 


else { 
/* a departure */ 
qindx = auxinfo. type; 
dtime = auxinfo.time; 
depart(qindx, dtime); 
} /* end if */ 
} /* end while */ 
printf(“average time is %4.2f", tobtine/count); 
} /* end main */ 


The routine arrive(atime, dur) modifies the queues and the event list to reflect á 
new arrival at time atime with a transaction of duration dur. It inserts a new customer 
node at the rear of the shortest queue by calling the function insert(&q{j], &auxinfo). 
The insert routine must be suitably modified to handle the type of node in this example 
and must also increase g[j].num by 1. If the customer is the only one on the queue, a 
node representing his or her departure is added to the event list by calling on the function 
place(&eviist, &auxinfo). Then the next data pair (if any) is read and an arrival node 
is placed on the event list to replace the arrival that has just been processed. If there is 
no more input, the function returns without adding a new amival node and the program 
processes the remaining (departure) nodes on the event list. 


~ 


void arrive(int atime, int dur) 
{ 
int i, j, small; 
. f* find the shortest queue */ 
j= 0; 
small = q[0] .num; 
for (i = 1; i < 4; i++) 
if (qali].num < small) { 
smal] = q[i].num; 
j=i; 
} /* end for... if */ 
/* Queue j is the shortest. Insert à new customer node. */ 
auxinfo.time = atime; 
auxinfo.duration = dur; 
auxinfo.type = j; 
insert(&q[{j], &auxinfo); 
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/* Check if this is the only node on the queue. If it */ 
/* is, the customer's departure node must be placed on */ 
gs the event list. LF | 
if (q[j].num == 1) { 

auxinfo.time = atime + dur; 

place(&evlist, &auxinfo); 


/* If any input remains, read the next data pair and */ 

/* place an arrival on the event list. */ 

printf("enter time\n”); 

if (scanf("xd", &auxinfo.time) != EOF) { 
printf("enter duration\n"); 
scanf("%d", &auxinfo. duration); P arm 
auxinfo.type = -1; l / 
place(&evlist, &auxinfo); 

} /* end if */ . á 

} 7* end arrive */ / 


~/ 


The routine depart(gindx, dtime) modifies the queue qigindx] and the event list 
to reflect the departure of the first customer on the queue at time dtime: The cus- 
tomer is removed from the queue by the call remove(&aq[gindx], &auxinfo), which 
must be suitably modified to handle the type of node in this example and must &!so 
decrement the queue’s num field by 1. The departure node of the next customer on 
the queue (if any) replaces the departure node that has just been removed from the 
event list. 


void depart(int qindx, int dtime) 


NODEPTR p; 
remove (&q[qindx] ,° &auxinfo); 
tottime = tottime + (dtime - auxinfo. time); 
count++; 
./* if there are any more customers on the Queue, */ 
_/* place the departure of the next customer onto */ 
/* the event list after computing its departure time */ 
if (q{qindx].num>0) { 
P = q{qindx). front; 
auxinfo.time = dtime + p->duration; 
auxinfo.type = qindx; 
place(&evlist, &auxinfo); 
} /* end if */ 
} /* end depart */ 


Simulation programs are rich in their use of list structures. The reader is urged 
to explore the use of C for simulation and the use of special-purpose simulation 
languages., ‘ 
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4.4.1. 


4.4.4. 


4.4.5. 


4.4.6. 


4.4.7. 


4.4.8. 


In the bank simulation program of the text, a departure node on the event list represents 
the same customer as the first node on a customer queue. Is it possible to use a single 
node for a customer currently being serviced? Rewrite the program of the text so that 
only a single node is used. Is there any advantage to using two nodes? 

The program in the text uses the same type of node for both customer and event nodes. 
Rewrite the program using two different types of nodes for these two purposes. Does 
this save space? 


Revise the bank simulation program of the text to determine the average length of the 

four lines. : 

Modify the bank simulation program to compute the standard deviation of the time spent” 
by a customer in the bank. Write another program that simulates a single line for all four 

tellers with the customer at the head of the single line going to the next available teller. 

Compare the means and standard deviations of the two methods. 

Modify the bank simulation program so that whenever the length of one line exceeds 


the length of another by more than two, the last customer on the longer line moves to the 
rear of the shorter. 


Write a C program to simulate a simple multiuser computer system as follows: Each 
user has a unique ID and wishes to perform a number of transactions on the computer. 
However, only one transaction may be Processed by the computer at any given moment. 
Each input line represents a sing!e user and contains the user’s ID followed by a starting 
time and a series of integers representing the duration of each of his or her transactions. 
The input is sorted by increasing starting time, and all times and durations are in seconds. 
Assume that a user does not request time for a transaction until the previous transaction 
is complete and that the computer accepts transactions on a first-come, first-served basis. 
The program should simulate the system 2nd print a message containing the user ID and 
the time whenever a transaction begins and ends. At the end of the simulation it should 
print the average waiting time for a transaction. (The waiting time is the amount of time 
between the time that the transaction was requested and the time it was started.) 

What parts of the bank simulation program would have to be modified if the priority 
queue of events were implemented as an array or as an unordered list? How would they 
be modified? 

Many simulations do not simulate events given by input data but rather generate events 
according to some probability distribution. The following exercises explain how. Most 
computer installations have a random number generating function rand(x). (The name 
and parameters of the function vary from System to system. rand is used as an example 
only.) x is initialized to a value called a seed. The statement x = rand(x) resets the value 
of the variable x to a uniform random real number between 0 and 1. By this we mean 
that if the statement is executed a sufficient number of times and any two equal-length 
intervals between O and | are chosen, approximately as many of the successive values of 
x fall into one interval as into the other. Thus the probability of a value of x falling in an 
interval of length / <= | equals /. Find out the name of the random number generating 
function on your system and verify that the foregoing is true. Given a random number 
generator rand consider the following statements: 


x = rand(x); 
y = (b-a)*x + a 
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4.4.9, 


(a) 


(b) 


Show that, given any two equal-length intervals within the interval from ato b, 
if the statements are repeated sufficiently often, an approximately -equal number 
of successive values of y fall into each of the two intervals. Show that if a and b 
are integers, the successive values of y truncated to an integer equal each integer 
between a and b — | an approximately equal number of times. The variable yis 
said to be a uniformly distributed random variable. What is the average of the 
values of y in terms of a and b? 

Rewrite the bank simulation of the text, assuming that the transaction duration: 
is uniformly distributed between 1 and 15. Each data pair represents an arriving 
customer and contains only the time of arrival. Upon reading an input line, generate 
a transaction duration for that customer by computing the next value according to 
the method just outlined. 


The successive values of y generated by the following statements are called normally 
distributed. (Actually, they are approximately normally distributed, but the approxima- 


tion is close enough.) 


(a) 
(b) 


, 


float x[15]; 

float m, s, sum, y; 

int i; 

/* statements initializing the values of s, m and */ 

"/* the array x go here ah fa 

while ( /* a terminating condition goes here */ ) { 
sum = 0; 


for (i = 0; i< 15; i++) { 
x[i] = rand(x[i]); 
sum = sum + x[i]; 
} /* end for */ 
y=s * (sum - 7.5) / sqrt(1.25) + m; 
/* statments that use the value of y go here */ 
} /* end while */ 


Verify that the average of the values of y (the mean of the distribution) equals m 
and that the standard deviation equals s. 

A certain factory produces items according to the following process: an item must 
be assembled and polished. Assembly time is uniformly distributed between 100 
and 300 seconds, and polishing time is normally distributed with a mean of 20 
seconds and a standard deviation of 7 secortts (but values below 5 are discarded). 
After an item is assembled, a polishing machine must be used, and a worker cannot 
begin assembling the next item until the item he or she has just assembled has been 
polished. There are ten workers byt only one polishing machine. If the machine 
is not available, workers who have finished assembling their items must wait for 
it. Compute the average waiting time per item by means of a simtilation. Do the 


same under the assumption of two and three polishing machines. 


4.5 OTHER LIST STRUCTURES 


Although a linked linear list is a useful data structure, it has several shortcomings. In 
this section we present other methods of organizing a list and show how they can be 
used to overcome these shortcomings. 
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Figure 4.5.1 Circular list. 
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Figure 4.5.2 First and last nodes of a circujar list. 


Circular Lists 


Given a pointer p to a node in a linear list, we cannot reach any of.the nodes that 
precede node(p). If a list is traversed, the external pointer to the list must be preserved 
to be able to reference the list again. 

Suppose that a small change is made to the structure of a linear list, so that the 
next field in the last node contains a pointer back to the first node rather than the null 
pointer. Such a list is called a circular list and is illustrated in Figure 4:5.1.From any 
point in such a list it is possible to reach any other point in the list. If we begin at a 
given node and traverse the entire list, we ultimately end up at the starting point. 

Note that a circular list does not have a natural “first” or “last” node. We must, 
therefore, establish a first and last node by convention. One useful convention is to let 
the external pointer to the circular list point to the last node, and to allow the following 
node to be the first node, as illustrated in Figure 4.5.2. If p is an external pointer to 
a circular list, this convention allows access to the last node of the list by referencing 
node(p) and to the first node of the list by referencing node(next(p)). This convention 
provides the advantage of being able to add or remove an element conveniently from 
either the front or-the rear of a list. We also establish the convention that a null pointer 
represents an empty circular list. 


Stack as a Circular List A 


A circular list can be used to represent a stack or a queue. Let stack be a pointer 
to the last node of a circular list and let us adopt. the convention that the first node is 
the top of the stack. An empty stack is represented by a null list. The following is a C 
function to determine whether the-stack is empty. It is called by empry(&stack). 


int empty(NODEPTR *pstack) 
{ 


return ((*pstack == NULL) ? TRUE : FALSE); 
} /* end empty */ 
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The following is aC function to push an integer x onto the stack, The push function 
calls on the function empry, which tests whether its parameter is NULL. It is called by 
push(&stack, x), where stack is a pointer to a circular list acting as a stack. 


nik push(NODEPTR *pstack, int x) 


NODEPTR p; 

p = getnode(); 

p->info = x; 

if (empty(pstack) == TRUE) 
*pstack = p; 

else 
p->next = (*pstack) -> next; 

(*pstack) -> next = p; 

} /* end push */ 


Note that the push routine is slightly more complex for circular lists than itis for linear ` 
lists. ° 

The C pop function for a stack implemented as a circular list calls the function 
freenode introduced earlier. pop is called by pop(&stack), ; 


int pop(NODEPTR *pstack) 
{ 


int x; 

NODEPTR p; . 

if (empty(pstack) == TRUE) { 
printf("stack underflow\n"); 
exit(1); 

} /* end if */ 

p = (*pstack) -> next; 

X = p->info; 

if (p == *pstack) 
/* only one node on the stack */ 
*pstack = NULL; 

else 
(*pstack) -> next = p->next; 

freenode(p); 

return(x); 

} /* end pop */ 


> 


Queue as a Circular List 


It is easier to represent a queue as a circular list than as a linear list. As a linear 
list, a queue is specified by two pointers, one to the front of the list and the other to its . 
rear. However, by using a circular list, a queue may be specified by a single pointer q 
to that list. node(q) is the rear of the queue and the following node is its front. 

The function empry is the same as for Stacks. The routine remove(pq) called by 
remove(&q) is identical to pop except that all references to pstack are replaced by pq, 
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a pointer to q: ‘Ihe C routine insert is called by the statement insert(&q, x) and may be 
coded as follows: 


void insert(NODEPTR *pq, int x) 
{ 


NODEPTR p; s 
p = getnode(); 
p->info = x; ; 
if (empty(pq) == TRUE) 
*PA = p; 
else 
| “pronext = (*pq) -> next; 
(*pq) -> next = p; 
*pa = p; 
return; ; 
} /* end insert */ 


Note that insert(&q, x) is equivalent to the code 


` push(&q, x); 
q = q->next; 


That is, to insert an element into the rear of a circular queue, the elemnt.is inserted 


into the front of the queue and the circular list pointer is then advanced one element, so 
that the new element becomes the rear. i 


Primitive Operations on Circular Lists 


The routine insafter(p, x), which inserts a node containing x after node(p), is sim- 
ilar to the corresponding routine for linear lists as presented in Section 4.3. However, 
the routine delafter(p, x) must be modified slightly. Looking at the corresponding rou- . 
tine for linear lists as presented in Section 4.3, we note one additional consideration in 
the case of a circular list. Suppose that P points to the only node in the list. In a linear 
list, next(p) is null in that case, making the deletion invalid. In the case of a circular 
list, however, next(p) points to node(p), so that node(p) follows itself. The question is 
whether or not it is desirable to delete node(p) from the list in this case. It is unlikely that 
we would want to do so, since the operation delafter is usually invoked when pointers 
to each of two nodes are given, one immediately following another, and it is desired to 


delete the second. delafter for circular lists using the dynamic node implementation is 
implemented as follows: n! 


void delafter(NODEPTR p, int *px) 
{ 


NODEPTR q; 

if ((p == NULL) || (p == p->next)) { 
/* the list is empty or contains only a single node */ 
printf("void deletion\n"); 
return; 

} /* end if */ 
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q = p->next; 
*px = q->info; 
P->next = q->next; 
freenode(q); 
return; 

} /* end delafter */ 


Note, however, that insafter cannot be used to insert a node following the last 
node in a circular list and that delafter cannot be used to delete the last node of a cir- 
cular list. In both cases the external pointer to the list must be modified to point to the 
new last node. The routines can be modified to accept list as an additional parameter 
anu to change its value when necessary. (The actual parameter in the calling routine 
would have to be &/ist, since its value is changed.) An alternative is to write separate 
routines insend and dellast for these cases. (insend is identical to the insert operation 
for a queue implemented as a circular list.) The calling routine would be responsible 
for determining which routine to calh. Another alternative is to give the calling rou- 
tine the responsibility of adjusting the external pointer /ist if necessary. We leave the 
exploration of these possibilities to the reader. 

If we are managing our own available list of nodes (as for example under the 
array implementation), it is also easier to free an entire circular list than to free a linear 
list. In the case of a linear list the entire list must be traversed, as one node at a time 
is returned to the available list. For a circular list. we can write a routine freelist that 
effectively frees an entire list by simply rearranging pointers. This is left as an exercise 
for the reader. 

Similarly, we may write a routine concar(&listl, &list2) that concatenates two 
lists; that is, it appends the circular list pointed to by list2 to the end of the circular list 
pointed to by /ist1. Using circular lists, this can be done without traversing either list: 


void concat(NODEPTR *plistl, NODEPTR *plist2) 
{ 


NODEPTR p; 

if (*plist2 == NULL) 
return; bi: on 

if (*plistl == NULL) { 
*plistl =.*plist2; 
return; 

} 

p = (*plistl) -> next; 

(*plistl) -> next = (*plist2) -> next; 

(*plist2) -> next = p; 

*plistl = *plist2; 

return; 

} /* end concat */ 


: The Josephus Problem 


Let us consider a problem that can be solved in a straightforward manner by us- 
ing a circular list. The problem is known as the Josephus problem and postulates a group 
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of soldiers surrounded by an overwhelming enemy force. There is no hope for victory 

without reinforcements, but there is only a single horse available for escape. The sol- 

diers agree to a pact to determine which of them is to escape and summon help. They 

form a circle and a number n is picked from a hat. One of their names is also picked from 

_ ahat. Beginning with the soldier whose name is picked, they begin to count clockwise’ 
around the circle. When the count reaches n, that soldier is removed from the circle, and 

the count begins again with the next soldier. The process continues so that each time 

the count reaches n, another soldier is removed from the circle. Any soldier removed 

from the circle is no longer counted. The last soldier remaining is to take the horse and . 
escape. The problem is, given a number n, the ordering of tlie soldiers inthe circle, and, 
the soldier from whom the count begins, to determine the order in which’ soldiers are 

eliminated from the circle and which soldier escapes. 

The input to the program is the number n and a list of names; which is the clock- 
wise ordering of the circle, beginning with the soldier from whom the count is to start. 
The last input line contains the string “end” indicating the’end of the input. The pro- 
gram should print the names in the order that they are eliminated and the name of the 
soldier who escapes. i i 

For example, suppose that n = 3 and that there are five soldiers named A, B, C, 
D, and E. We count three soldiers starting at A, so that C is eliminated first. We then 
begin at D and count D, E, and back to A, so that A is eliminated next. Then we count 
B, D, and E (C has already been eliminated), and finally B, D, and-B, so that D is the 
one who escapes. 

Clearly, a circular list in which each node represents one soldier is a natural data 
structure to use in solving this problem. It is possible to reach any node from any other 
by counting around the circle. To represent the removal of a soldier from the circle, a 
node is deleted from the circular list. Finally, when only one node remains on the list, 
the result is determined. 

An outline of the program might be as follows: 


‘read(n); 

read(name) ; 

while (name != END) { 
insert name on the circular list; 
read(name) ; 

} /* end while */ 

while (there is more than one node on the list) { 
count through n- 1 nodes on the list; 
print the name in the nth node; 
delete the nth node; 

} /* end while */ 

print the name of the only node on the list; 


We assume that a set of nodes has been declared as before except that the info field 
holds a character string (an array of characters) rather than an integer. We also assume at 
least one name in the input. The program uses the routines insert, delafter, and freenode. 
The routines insert and delafter must be modified, since the information portion of the 
node is a character string. Assignment from one character string variable to another is 
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accomplished via a loop. The program also makes use of a function egstr(str!, str2), 


which returns TRUE if strl is-tdentical to str2, and FALSE otherwise. The coding of 
this routine is left to. the reader. i 


void josephus(void) 
{ 


char tend = "end"; 
char name[MAXLEN] ; 
int i, n; 
NODEPTR list = NULL; 
printf("enter n\n"); 
scanf("%d", &n); 
/* read the names, placing each */ 
/* at the rear of the list */ 
printf(“enter names\n"); 
scanf("%s", &name); 
/* form the list */ 
while (!eqstr(name, end)) { 
`e insert(&list, name); 
scanf("%s", name); 
} /* end while */ me ie 
printf("the order in which the soldiers are eliminated is:\n"); 
/* continue counting as long as more */ í 
/* than one node remains on the list */ 
while (list != list->next) { 
for (i = 1; i < n; i++) 
list = list->next; 
/* list->next points to the nth node */ 
delafter(list, name); 
printf("%s\n", name); 
} /* end while */ 
/* print the only name on the list and free its node */ 
printf("the soldier who escapes is: %s", list->info); 
freenode(list); 
} /* end josephus */ Ps 


Header Nodes 


Suppose that we wish to traverse a circular list. This can be done by repeatedly 
executing p = p— next, where p is initially a pointer to the beginning of the list. How- 
ever, since the list is circular, we will not know when the entire list has been traversed 
unless another pointer. list. points to the first node and a test is made for the condition 
p == list. | 

An alternative method is to place a header-node as the first node of a circular list. 
This list header may be recognized by a special value in its info field that cannot be 
the valid contents of a list node in the context of the problem. or it may contain a flag 
marking it as a header, The list can then be traversed using a single pointer, with the 
traversal halting when the header node is reached. The external pointer to the list is to 
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Figure 4.5.3 Circular list with a header node. 


its header node, as illustrated in Figure 4.5.3. This means that a node canmot easily be’ 
added onto the-rear of such a circular list, as could be done when the external pointer 
was to the last node of the list. Of course, it is possible to keep a pointer to the last node 
of a circular list even when a header node is being used. 

If a stationary external pointer to a circular list is used in addition to the pointer 
used for traversal, the header node need not contain a special code but can be used 
in much the same way as a header node of a linear list to contain global information 
about the list. The end of a traversal would be signaled by the equality of the traversing 
pointer and the external stationary pointer. 


Addition of Long Positive integers Using Circular Lists 


We now present an application of circular lists with header nodes. The hardware 
of most computers allows integers of only a specific maximum length. Suppose that 
we wish to represent positive integers of arbitrary length and to write a function that 
returns the sum of two such integers. 

To add two such long integers, their digits are traversed from right to left, and 
corresponding digits and a possible carry from the previous digits’ sum are added. This 
Suggests representing long integers by storing their digits from right to left in a list 
so that the first node on the list contains the least significant digit (rightmost) and the 
last node contains the most significant ( leftmost). However, to save space, we keep 
five digits in each node. (Long integer variables are used so that numbers as large as 
99999 may be kept in each node. The maximum size of an integer is implementation- 
dependent; therefore you may have to modify the routines to hold smaller numbers in 
each node.) We may declare the set of nodes by 


struct node { 
Tong int info; 
Struct node *next; 
is 
typedef struct node *NODEPTR; 


Since we wish to traverse the lists during the addition but wish to eventually 
restore the list pointers to their original values, we use circular lists with headers. 
The header node is distinguished by an info value of —1. For example, the integer 
4597634972 10698463 is represented by the list illustrated in Figure 4.5.4. 
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Figure 4.5.4 Large integer as a circular list. 


Now let us write a function addint that accepts pointers to two such lists repre- 
senting integers, creates a list representing the sum of the integers, and returns a pointer . 
to the sum list. Both lists are traversed in parallel, and five digits are added at a time. 
If the sum of two five-digit numbers is x, the low-order five digits of x can be extracted 
by using the expression x % 100000, which yields the remainder of x on division by 
100000. The carry can be computed by the integer division x/100000. When the end of 
one list is reached, the carry is propagated to the remaining digits of the other list. The 
function follows and uses the routines getnode and insafter. 


NODEPTR: addint(NODEPTR p, NODEPTR q) 
{ 
long int hunthou = 100000L; 
Jong int carry, number, total; 
NODEPTR s; 
/* set p and q to the nodes following the headers */ 
p = p->next; 
q = q->next; 
/* set up a header. node for the sum */ 
s = getnode(); 
s->info = -1; 
S->next = S; 
/* initially there is no carry */ 
carry = 0; 
while (p->info != -1 & q->info != -1) { 
/* add the info of the two nodes */ 
the and previous carry a 
total = p->info + q->info + carry; 
/* Determine the low order five digits of */ 
/* the sum and insert into the list. ¥/ 
number = total % hunthou; 
insafter(s, number); 
Nh advance the traversals +f 
S = S->next; 
p = p->next; 
q = q->next; 
/* determine whether there is a carry */ 
carry = total / hunthou; 
} /* end while */ 
/* at this point, there may be nodes left. in one of the */ 
is two input lists */ 
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while (p->info != -1) { 
total = p->info + carry; 
number = total % hunthou; 
‘insafter(s, number); 
carry = total / hunthou; 
S = S->next; 
p = p->next; 
} /* end while */ 
while (q->info != -1) { 
total = q->info + carry; 
number = total % hunthou; 
insafter(s, number); 
carry = total / hunthou; 
S = S->next; 
Q.= q->next; 
} /* end while */ 
/* check if there is an extra carry from the first */ 
a five digits Ed, 
if (carry == 1) { 
insafter(s, carry); 
S = S->next; 
} /* end if */ 
/* s points to the last node in the sum. s->next points to */ 
/* the header of the sum list. $y 
return(s->next); 
} /* end addint */ 


Doubly Linked Lists 


Although a circularly linked'list has advantages over a linear list, it still has sev- 
eral drawbacks. One cannot traverse such’a list backward, nor can a node be deleted 
from a circularly linked list, given only a pointer to that node. In cases where these 
facilities are required, the appropriate data structure is a doubly linked list. Each node 
in such a list contains two pointers, one to its predecessor and another to its succes- 
sor. In fact, in the context of doubly linked lists, the terms predecessor and succes- 
Sor are meaningless, since the list is entirely symmetric. Deubly linked lists may be 
either linear or circular and may or may not contain a header node, as illustrated in 
Figure 4.5.5. 

We may consider the nodes on-a doubly linked list to consist of three fields: an 
info field that contains the information stored in the node, and left and right fields that 
contain pointers to the nodes on either side. We may declare a set of such nodes using 
either the array or dynamic implementation, by 


Array Implementation Dynamic Implementation 


Struct nodetype { struct node { 
int info; ' int info; 
int left, right; struct node *left, *right; 
struct nodetype node[NUMNODES]; typedef struct node *NODEPTR: 
pk SOAR SLSR i e A a e 
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Figure 4.5.5 Doubly linked tists. 


Note that the available list for such a set of nodes in the array implementation 
need not be doubly linked, since it is not traversed bidirectionally. The available list 
may be linked together by using either the left or right pointer. Of course, appropriate 
getnode and freenode routines must be written. 

We now present routines to operate on doubly linked circular lists. A convenient 
property of such lists is that if p is a pointer to any node, letting /eft(p) be an abbreviation 


for node[ p].left or p—> left, and right(p) an abbreviation for node{ p).right or p-> 
right, we have i 


leřt(right(p)) = p= right(left(p)) 


One operation that can be performed on doubly linked lists but not on ordinary linked 
lists is to delete a given node. The following C routine deletes the node pointed to 
by p from a doubly linked list and stores its contents in x, using the dynamic node 


implementation. It is called by delete(p, &x). 
void delete(NODEPTR p, int *px) 
{ 


NODEPTR.q, r; 

if (p == NULL) { 
printf("void deletion\n"); 
return; 

} /* end if */ 
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*px = p->info; 
q = p->left; 

r = p->right; 
q->right = r; 
r->left = q; 
freenode(p); 
return; 

} /* end delete */ 


The routine insertright inserts a node with information field x to the right of 
node(p) in a doubly linked list: 


void insertright(NODEPTR p, int x) 
{ 


NODEPTR q, r; 

if (p == NULL) { 
printf("void insertion\n"); 
return; 

} /* end if */ 

q = getnode(); 

q->info = x; 

r = p->right; 

r->left = q; 

q->right = r; 

q->left = p; 

p->right = q; 

return; 

} /* end insertright */ 


A routine insertleft to insert a node with information field x to the left of node(p) in a 
doubly linked list is similar and is left as an exercise for the reader. 

When space efficiency is a consideration, a program may not be able to afford 
the overhead of_two pointers for each element of a list. There are several techniques 
fof compressing the left and right pointers of a node into a single field. For example, 
a single pointer field ptr in each node can contain the sum of pointers to its left and 
right neighbors. (Here, we are assuming that pointers are represented in such a way 
that arithmetic can be performed on them readily. For example, pointers represented by 
array indexes can be added and subtracted. Although it is illegal to add two pointers 
in C, many compilers will allow such pointer arithmetic.) Given two external pointers, 
p and q, to two adjacent nodes such that p == left(q), right(q) can -be computed as 
pirig) — p and left(p) can be computed as ptr(p) — q. Given p and q, it is possible to 
delete either node and reset its pointer to the preceding or succeeding node. It is also 
possible to insert a node to the left of node() or to the right of node(q) or to insert a 
node between nede(p) and node(q) and reset either p or q to the newly inserted node. 
In using such a scheme, it is crucial always to maintain two external pointers to two 
adjacent nodes in the list. 

Addition of l.ong Integers Using Doubly Linked Lists 


As an illustration of the use of doubly linked lists, let us consider extending the 
list implementation of long integers to include negative as well as positive integers. 
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The header node of a circular list representing a long integer contains an indication of 
whether the integer is positive or negative. 

‘To add a positive and a negative integer, the smaller absolute value must be sub- 
tracted from the darger absolute value and the result must be given the sign of the integer 
with the larger absolute value. Thus, some method is needed for testing which of two 
integers represented as circular lists has the larger absolute value. 

The first criterion that may be used to identify the integer with the larger absolute 
value is the length of the integers (assuming that they do not contain leading Os). The 
list with more nodes represents the integer with the larger absolute value. However, 
actually counting the number of nodes ‘nvolves an extra traversal of the list. Instead of 
counting the number of nodes, the count could be kept as part of the header node and ` 
referenced as needed. i 

However, if both lists have the same number of nodes, the integer whose most 
significant digit is larger has the greater absolute value. If the leading digits of both 
integers are equal, it is necessary to traverse the lists from the most significant digit to 
the least significant to determine which number is larger. Note that this traversal is in 
the direction opposite that of the traversal used in actually adding or subtracting two 
integers. Since we must be able to traverse the lists in both directions, aaeiy linked 
lists are used to represent such integers. ; 

Consider the format of the header node. In addition to a right and left pointer, the 
header must contain the length of the list and an indication of whether the number is 
positive or negative. These two pieces of information can be combined into a single 
integer whose absolute value is the length of the list and whose sign is the sign of 
the number being represented. However, in so doing, the ability to identify the header 
node by examining the sign of its info field is destroyed. When a positive integer was 
represented as a singly linked circular list, an info field of — 1 indicated a header node. 
Under the new representation, however, a header node may contain an info field such 
as 5 which is a valid info field for any other node in the list. - 

There are several ways to remedy this problem. One way is to add another field to 
each node to indicate whether or not it is a header node. Such a field could contain the 
logical value TRUE if the node is a header and FALSE if it is not. This means, of course, 
that each node would require more space. Alternatively, the count could be eliminated 
from the header node and an info field of — 1 would indicate a positive number and —2 
a negative number. A header node could then be identified by its negative info field. 
However, this would increase the time needed to compare two numbers, since it would 
be necessary to count the number of nodes in each list. Such space/time trade-offs are 
common in computing, and a decision must be made about which efficiency should be 
` sacrificed and which retained. 

In our case we choose yet a third option, which is to retain an external pointer to 
the list header. A pointer p can be identified as pointing to a header if it is equal to the 
original external pointer; otherwise node(p) is not a header. 

Figure 4.5.6 indicates a sample node and the representation of four integers as 
doubly linked lists. Note that the least significant digits are to the right of the header 
and that the counts in the header nodes do not include the header node itself. 

Using the preceding representation, we present a function compabs that compares 
the absolute values of two integers represented as doubly linked lists. Its two parameters 
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(b) The integer -3242197849762. 


ice 


(c) The integer 676941. 


Header 


Header 


(d) The integer 0 


Figure 4.5.6 Integers as doubly linkad lists. 


are pointers to the list headers and it returns | if the first has the greater absolute value, 
— 1 if the second has the greater absolute value, and 0 if the absolute values of the two 
integers are equal. 


int compabs(NODEPTR p, NODEPTR q) 
{ 


NODEPTR r, s; 
/* compare the counts */ 
if (abs(p->info) > abs(q->info)) 
return(1); uan 
“ if (abs(p->info) < abs(q->info)) 
return(-1); 
/* the counts are equal */ ° 
r = p->left; 
s = q->left; 
/* traverse the list from the most significant digits */ 
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} 


while (r != p) { en 
if (r->info > s->info) 
return(1); 
if (r->info < s->info) 
return(-1); 
r=r->left; 
S.= s->left; 
} /* end while */ 
/**the absolute values are equal */ 
return(0); E 
/* end compabs */ 


We may now write a function addiff that accepts two pointers to doubly linked 
lists representing long integers of differing signs, where the absolute value of the first is 


In this function, p points to the number with the larger absolute value and g points 
-to the number with the smaller absolute value. The values of these variables do not 
change. Auxiliary variables pptr and gptr are used to traverse the lists. The sum is 
‘formed in a list pointed to by the variable r. 


NODEPTR addiff(NODEPTR p, NODEPTR q) 


{ 
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int count; 

NODEPTR pptr, aptr, r, s, zeroptr; 
long int hunthou = 100000L; 

long int borrow, diff; 

int zeroflag; 

/*. initialize variables */ 

count = 0; 


„borrow = 0; ` 


zeroflag = FALSE; 
/* generate a header node for the sum */ 
r = getnode(); 
r->left = r; 
r->right = r; 
/* traverse the two lists 
pptr = p->right; 
aptr = q->right; 
while (qptr != q) { 
diff = pptr->info - borrow - qptr->info; 
if €diff >= 0) 
borrow = 0; 
else { 
diff = diff + hunthou; 
borrow = 1; 
} /* end if */ 
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/* generate a new node and insert it-*/ 
/* to the left of header in sum */ 
insertleft(r, diff); 
count += 1; 
/* test for zero node */ 
if (diff == 0) { 
if (zeroflag == FALSE) 
zeroptr = r->left; 
zeroflag = TRUE; 


else 
zeroflag = FALSE; 
pptr = pptr->right; 
gptr = qptr->right; 
} /* end while */ 
/* traverse the remainder of the p list */ 
while (pptr != p) { 
diff = pptr->info - borrow; 
if (diff >= 0) 
borrow = 0; 
else { s 
diff = diff + hunthou;: 
‘borrow = 1; 
} /* end if */ 
insertleft(r, diff); 
count += 1; 
if (diff == 0) { 
if (zeroflag == FALSE) 
zeroptr = r->left; 
zeroflag = TRUE; 
} 
else 
js -  Zeroflag = FALSE; 
pptr = pptr->right; 
} /* end while */ 
if (zeroflag == TRUE) /* delete leading zeros */ 
while (zeroptr != r) { 
s = zeroptr; 
zeroptr = zeroptr->right; 
delete(s, &diff); 
count -= 1; 
} /* end if...while */ 
/* insert count and sign into the header */ 
if (p->info > 0) 
r->info = count; 
else 
r->info = -count; 
return(r); 
} /* end addiff */ 
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We can also write a function addsame to add two numbers with like signs. This is 
very similar to the function addint of the previous implementation except that it deals 
with a doubly linked list and must keep track of the number of nodes in the sum. 

Using these routines we can write a new version of addint that adds two integers 
represented by doubly linked lists. : 


NODEPTR addint(NODEPTR p, NODEPTR q) 


{ 


/* check if integers are of like sign */ 
if (p->info * q->info > 0) 
return(addsame(p, q)); 
/* check which has a larger absolute value */ 
if (compabs(p, q) > 0) 
return(addiff(p, q)); 
else 
return(addiff(q, p)); 


} /* end addint */ 


EXERCISES 


4.5.1. 


4.5.2. 


4.5.3. 


4.5.4. 


4.5.5. 


4.5.6. 
4.5.7. 
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Write an algorithm and a C routine to perform each of the Dperations of Exercise 4.2.3 
for circular lists. Which are more efficient on circular lists than on linear lists? Which 
are less efficient? 


Rewrite the routine place of Section 4.3 to insert a new item in an ordered circular list. 


Write a program to solve the Josephus problem by using an array rather than a circular 
list. Why is a circular list more efficient? ; 

Consider the following variation of the Josephus problem. A group of people stand in a 
circle and each chooses a positive integer. One of their names and a positive integer n 
are chosen. Starting with the person whose name is choseñ, they count around the circle 
clockwise and eliminate the nth person. The positive integer that that person chose is 
then used to continue the count. Each time that a person is eliminated, the number that ` 
he or she chose is used to determine the next person eliminated. For example, suppose 
that the five people are A, B, C, D, and E and that they choose integers 3, 4, 6, 2, and 
7, respectively, and that the integer 2 is initially chosen. Then if we start from A, the 
order in which people are eliminated from the circle is B, A, E, C, leaving D as the last 
one in the circle. 


. Write a program that reads a group of input lines. Each input line except the first and last 


contains a name and a positive integer chosen by that person. The order of the names 
in the data is the clockwise ordering of the people in the circle, and the count is to start 
with the first name in the input. The first input line contains the number of people in 
the circle. The last input line contains only a single positive integer representing the 
initial count. The program prints the order in which the people are eliminated from the 
circle. 

Write a C function mulrint(p, q) to multiply two long positive integers represented by 
singly linked circular lists. 


Write a program to print the 100th Fibonacci number. 


Write an algorithm and a C routine to perform each of the operations of Exercise 4.2.3 
for doubly linked circular lists. Which are more efficient on doubly linked than on 
singly linked lists? Which are less efficient? 
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4.5.8. Assume that a single pointer field in each node of a doubly linked list contains the 
sum of pointers to the node's predecessor and successor, as described in the text. Given 
pointers p and q to two adjacent nodes in such a list, write C routines to insert a node to 
the right of node(q), to the left of node(p), and between node(p) and node(q) modifying 
p to point to the newly inserted node. Write an additional routine to delete node(q), 
resetting q to the node’s successor. 

4.5.9. Assume that first and last are external pointers to the arti and last nodes of a doubly 
linked list represented as in Exercise 4.5.8. Write C routines to implement the opera- 
tions of Exercise 4.2.3 for such a list. i 

4.5.10. Write a routine addsame to add two long integers of the same sign represented by 

i doubly linked lists. ` 

4.5.11. Write a C function multint(p, q) to multiply two long integers represented by doubly 
linked circular lists. 

4.5.12. How can a polynomial in three variables (x, y, and z) be represented by a circular list? 
Each node should represent a term and should contain the powers of x, y, and z as well 
as the coefficient of that term. Write C functions to do the following. 

(a) Add two such polynomials. 
(b) Multiply two such polynomials. 


(c) Take the partial derivative of such a polynomial with respect to any of its vari- 
ables. 


(d) Evaluate such a polynomial for given values of x, y, and z. 


(e) Divide one such polynomial by another, creating a quotient and a remainder poly- 
nomial. 


(f) Integrate such a polynomial with respect to any of its variables. 

(g) Print the representation of such a polynomial. 

(h) Given four such polynomials f (x,y,z), 8(x,y.z), h(x,y,z) and i(x,y,z), compute the 
polynomial f (g(x,y.z), h(x,y.z), i(x,y,z)). 


4.6 LINKED LISTS IN C++ 


We now examine the implementation of linked lists in C++. We will look at singly 
linked linear lists and leave the details of circular and doubly linked lists to the reader. 
Before going into details about lists in C++, we introduce the built-in C++ mech- 


anism for allocating and freeing objects of a given type. If T is the name of a type, then 
the expression 


new T 


creates a new object of type T and returns a pointer to the newly created object, If T is a 
class with a constructor with no parameters, then the object created is also automatically 
initialized. If T is a class with a constructor with n parameters, then the expression 


new T(pl, p2, ..., pn) 


creates an object of type T, initializes it using the constructor with parameters pl 
through pn, and returns a pointer to it. 


Sec. 4.6 Linked Lists in C++ 245 


If p points to an object created via use of the new operator, then the statement 


` delete p; 


deallocates the object to which p was pointing. If the type has a destructor, the destructor 
is invoked prior to the deallocation. 
Now we can turn to a discussion of lists in C++. We envision a linked list as a 
data structure, with a fixed set of public operations on the list. This means that the user 
accesses the list as a whole and is unable to access individual nodes within the list and 
individual pointers to those nodes. If a particular'operation is desired on the list, it must 
be included in the public interface of the list class. 


For example, the following might be the class definition for a linked list of inte- 
gers, with the following operations: 


1. Initialize a list to the empty list. This is a constructor, automatically invoked when 


2: 


a 
4. 


nN 


a list is defined or created. 


Free the nodes of a list. This is a destructor, automatically invoked when a list is 


freed or the block in which it is declared is exited. 
Determine whether a list is empty. 


Add a node with a given value into the list following the first node with another 


given value. 


The class definition follows: 


class List { 
protected: 
struct node { 
int info; 
struct node *next; 
} 
typedef struct node *NODEPTR; 
NODEPTR listptr; // the pointer to the first node 
// of the list 
public: 
ListQ; 
~List(); 
int emptylist(); 
void insertafter(int oldvalue, int newvalue); 
void push(int newvalue); 
void delete(int oldvalue); 
int pop(); 
} 


We now present the implementation of these routines: 
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. Add a node with a given value to the front of the list. This is the push operator. 
. Delete.the first node with a given value from the list. 
. Delete the first node from the list. This is thé pop operator. 
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List is a constructor that initializes a newly created list to the empty list. 


List::ListQ { 
listptr = 0; 
} s: 


Rh a 


~ List is the destructor that traverses the nodes of a list, freeing them`one by one. 


List::~List() { AFO. 
NODEPTR p, q; ; ii 
_ if (emptylist()) 
return 0; 
for (p = listptr, q = p->next; p!=0;-p=q,qe p- >next) 
delete p; a 
} ‘ 


émptylist determines if a list is empty. 


int List::emptylist() { ; 
return(listptr == 0); re 

} i ER e 

insertafter(oldvalue, newvalue) searches for the first occurrence of the value 


oldvalue in the list and inserts a new node with value newvalue following the node: . 
containing oldvalue. 


mR 


List::insertafter(int oldvalue, int newva ue) { 
NODEPTR p, q; 


for (p = listptr; p != 0 & p->info != oldvalue; p = p->next) 


if (p == 0) 
error("ERROR: value sought is not on the list. r); 
q = new node; 
q->info = newvalue; 
q->next = p->next; 
p->next = q; 
} $ 


push(newvalue) adds a new node with a given value to the front of the list. 


List::push(int newvalue) { 
NODEPTR p; 
p = new node; 
p->info = newalue; 
p->next = listptr; 
listptr = p; 
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detete(oldvalue) deletes.the first node containing the value oldvalue from the list. 


List::delete(int oldvalue) { 
NODEPTR p, q; 


for (q=0, p=listptr; p!=0 && p->info!=oldvalue; q=p, p=p->next) 


if (p == 0) è 
error("ERROR: value sought is not on the list."); 
if (q == 0) i à 
listptr=p->next; 
else 
q->next=p->next; 
delete p; 
} 


Finally, pop deletes the first node on the list and returns its contents. 


int List::pop() { 
NODEPTR p; 
int x; 
if (emptytlist()) 
error("ERROR: the list is empty."); 
p =listptr; i 
listptr = p->next; 
x = p->info; 
delete p; 
return x; 


} 


Note that the List class does not permit the user to manipulate the nodes of the list; 
everything must be done via a method of List on the entire list. 


EXERCISES 


4.6.1. Modify the List class so that it uses a template and can be instantiated to implement a 
list of any type, not just integer. What problems may occur if you instantiate a list of 
lists? 

4.6.2. Write a class OrderedList to implement a sorted list into which elements can only be 
inserted in their proper place. Can OrderedList be a descendant of List? 


4.6.3. Add a method insertafter2(int oldvalue, int n, int newvalue) that inserts a node with 
value newvalue after the nth occurrence of vldvalue. 


4.6.4. Write a class CircList to implement a circular list. 
4.6.5. Write a class DoubleList to implement a doubly linked list. 
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In this chapter we consider a data structure that is useful in many applications: the 
tree. We define several different forms of this data structure and:show how they can be 
represented in C and how they can be applied to solving a wide variety of problems. As 
with lists, we treat trees primarily as data structures rather than as data types. That is, 
we are primarily concerned with implementation, rather than mathematical definition. 


5.1 BINARY TREES 


A binary tree is a finite set of elements that is either empty or is partitioned into three 
disjoint subsets. The first subset contains a single element called the root of the tree. 
The other two subsets are themselves binary trees, called the left and right subtrees of 
the original tree. A left or right subtree can be empty. Each element of a binary tree is 
called a node of the tree. 

A conventional method of picturing a binary tree is shown in Figure 5.1.1. This 
tree consists of nine nodes with A as its root. Its left subtree is rooted at B and its right 
subtrée is rooted at C. This is indicated by the two branches emanating from A: to B 
on the left and to C on the right. The absence of a branch indicates an empty subtree. 
For example, the left subtree of the binary tree rooted at C and the right subtree of the 
binary tree rooted at E are both empty. The binary trees rooted at D, G, H. and I have 
empty right and left subtrees. 

Figure 5.1.2 illustrates some structures that are not binary trees. Be sure that you 
understand why each of them is not a binary tree as just defined. > 
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Figure 5.1.1 Binary tree. 


Figure 5.1.2 Structures that are not binary trees. 
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Figure 5.1.3 Strictly binary tree. 


If A is the root of a binary tree and B is the root of its left or right subtree, then A 
is said to be the father of B and B is said to be the left or right son of A. A node that has 
no sons (such as D, G, H, or 7 of Figure 5.1.1) is called a leaf. Node n1 is an ancestor 
of node n2 (and n2 is a descendant of n1) if n1 is either the father of n2 or “the father of 
some ancestor of n2. For example, in the tree of Figure 5.1.1, A is an ancestor of G, and 
H is a descendant of C, but E is neither an ancestor nor a descendant of C. A node n2 
is a left descendant of node n1 if n2 is either the left son of n1 or a descendant of the 
left son of n1. A right descendant may be similarly defined. Two nodes are brothers if 
they are left and right sons of the same father. 

Although natural trees grow with their roots in the ground and their leaves in the 
air, computer scientists almost universally portray tree data structures with the root at 
the top and the leaves at the bottom. The direction from the root to the leaves is “down” 
and the opposite direction is “up.” Going from the leaves to the root is called “climbing” 
the tree, and going from the root to the leaves is called “descending” the tree. 

If every nonleaf node in a binary tree has nonempty left and right subtrees, the tree 
is termed a strictly binary tree. Thus the tree of Figure 5.1.3 is strictly binary, whereas 
that of Figure 5.1.1 is not (because nodes C and E have one son each). A strictly binary 
tree with n leaves always contains 2n — 1 nodes. The proof of this fact is left as an 
exercise for the reader. 

The level of a node in a binary tree is defined as follows: The root of the tree has 
level 0, and the level of any other node in the tree is one more than the level of its father. 
For example, in the binary tree of Figure 5.1.1, node E is at level 2 and node H is at 
level 3. The depth of a binary tree is the maximum level of any leaf in the tree. This 
equals the length of the longest path from the root to any leaf. Thus the depth of the tree 
of Figure 5.1.1 is 3. A complete binary tree of depth d is the strictly binary tree all of 
whose leaves are at level d. Figure 5.1.4 illustrates the complete binary tree of depth 3. 

If a binary tree contains m nodes at level /, it contains at most 2m nodes at level 
1+1. Since a binary tree can contain at most one node at level 0 (the root), it can contain 
at most 2! nodes at level /. A complete binary tree of depth d is the binary tree of depth 
d that contains exactly 2! nodes at each level / between 0 and d. (This is equivalent to 
saying that it is the binary tree of depth d that contains exactly 2¢ nodes at level d.) The 
total number of nodes in a complete binary tree of depth d, tn, equals the, sum of tne 
number of nodes at each level between 0 and d. Thus 
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Figure 5.1.4 Complete binary tree of depth 3.- 


d 
ma Wp +2 tory #2! = 52 
j=0 


By induction, it can be shown that this sum equals 2¢*! — 1. Since all leaves in such a 
tee are at level d, the tree contains 24 leaves and, therefore, 24 — | nonleaf nodes. 

Similarly, if the number of nodes, im, in a complete binary tree is known, we can 
compute its depth, d, from the equation rn = 21+! — 1. d equals | less than the number 
of times 2 must be multiplied by itself to reach rn + 1. In mathematics, log,x is defined 
as the number of times b must be multiplied by itself to reach x. Thus we may say that, 
in a complete binary tree. d equals logo(tn + 1) — 1. For example, the complete binary 
tree of Figure 5.1.4 contains 15 nodes and is of depth 3. Note that 15 equals 2°+! — 1 
and that 3 equals log>(15 + 1) — 1. log2x is much smaller than x [for example, logs 1024 
equals 10 and log» 1000000 is less than 20}. The significance of a complete binary tree 
is that it is the binary tree with the maximum number of nodes for a given depth. Put 
another way, although a complete binary tree contains many nodes, the distance from 
the root to any leaf (the tree’s depth) is relatively small. 

A binary tree of depth d is an almost complete binary tree if: 


1. Any node nd at level less than d — | has two sons. 


2. For any node nd in the tree with a right descendant at level d, nd must have a left 
son and every left descendant of nd is either a leaf at level d or has two sons. 


The strictly binary tree of Figure 5.1.5a is not almost complete. since it contains leaves 
at levels 1, 2. and 3, thereby violating condition 1. The strictly binary tree of Figure 
5.1.5b satisfies condition 1, since every leaf is either at level 2 or at level 3. However, 
condition 2 is violated. since A has a right descendant at level 3 (J) but also has a left 
descendant that is a leaf at level 2 (E ). The strictly binary tree of Figure 5.1.5c satisfies 
both conditions 1 and 2 and is therefore an almost complete binary tree. The binary 
tree of Figure 5.1.5d is also an almost complete binary tree but is not strictly binary, 
since node E has a left son but not a right son. {We should note that many texts refer to 
such a tree as a “complete binary tree” rather than as an “almost complete binary tree.” 
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(a) th) 


(ce) 


(d) 


Figure 5.1.5 Node numbering for almost complete binary trees. 
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Still other texts use the term “complete” or “fully binary” to refer to the concept that 
we call “strictly binary.” We use the terms “strictly binary,” “complete,” and “almost 
complete” as we have defined them here.) 

The nodes of an almost complete binary tree can be numbered so that the root is 
assigned the'number 1, a left son is assigned twice the number assigned its father, and a 
right son is assigned one more than twice the number assigned its father. Figure 5.1.5c 
and d illustrate this numbering technique. Each node in an almost complete binary tree 
is assigned a unique number that defines the node’s position within the tree, 

An almost complete strictly binary tree with n leaves has 2n— 1 nodes, as does any 
other strictly binary tree with n leaves. An almost complete binary tree with n leaves 
that is not strictly binary has 2n nodes. There are two distinct almost complete binary 
trees with n leaves, one of which is strictly binary and one of which is not. For example, 
the trees of Figure 5.1.5c and d are both almost complete and have five leaves; however, 
the tree of Figure 5.1.5c is strictly binary, whereas that of Figure 5.1.Sd is not. 

There is only a single almost complete binary tree with n nodes. This tree is Strictly 
binary if and only if n is odd. Thus the tree of Figure 5.1.5c is the only almost complete 
binary tree with nine nodes and is. strictly binary because 9 is odd, whereas the tree of 
. Figure/5.1.5d is the only almost complete binary tree with ten nodes and is not strictly 

binary because 10 is even. 

An almost complete binary tree of depth d is intermediate between the complete 
binary tree of depth d — 1, that contains 2% — 1 nodes, and the complete binary tree of 
depth d, which contains 29+! — | nodes. If mn is the total number of nodes in an almost 
complete binary tree, its depth is the largest integer less than or equal to logytn. For 

‘example, the almost complete binary trees with 4, 5, 6, and 7 nodes have depth 2, and 
the almost complete binary trees with 8, 9, 10, 11, 12, 13, 14, and 15 nodes have depth 3. 


Operations on Binary Trees 


There are a number of primitive operations that can be applied to a binary tree. 
If p is a pointer to a node nd of a binary tree, the function info(p) returns the con- 
tents of nd. The functions left(p), right(p), father(p), and brother(p) return pointers 
to the left son of nd, the right son of nd, the father of nd, and the brother of nd, re- 
spectively. These functions return the null pointer if nd has no left son, right son, fa- 
ther, or brother. Finally, the logical functions isleft(p) and isright(p) return the value’ 
true if nd is a left or right son, respectively, of some other node in the tree, and false 
otherwise. y 

Note that the functions isleft(p), isright(p), and brother(p) can be implemented 


using the functions left(p), right(p) and father(p). For example, isleft may be imple- ` 
mented as follows: f 


q= father(p); 
if (q == null) 
return( false); /* p points to the root */ 
if (left(q)- == p). 
return(true); 
rezurn( false); 
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“or, even simpler, as father(p) && p == left father(p)). isright may be implemented 


- ina similar manner, or by calling isleft. brother( p) may be implemented using isleft or 
isright as follows: 


if (father(p) == null) 

return(nul1); /* p points to the root */ 
if (isleft(p)) 

return(right(father(p))); 
return(left( father(p))) i 


in constrieugg a binary tree, the operations maketree, setleft, and setright are use- 
ful. maketree(x) creates a new binary tree consisting of a single node with information 
field x and returns a pointer to that node. setleft( p,x) accepts ‘a pointer p to a binary’ 
tree node with no left son. It creates a new left son of node( p) with information field x. 
setright( p,x) is analogous to setleft except that it creates a right son of node( p). 


Applications of Binary Trees 


A binary treé is a useful data structure when two-way decisions must be made at 
each point in a process. For example, suppose that we wanted to find all duplicates in 
a list of numbers. One way of doing this is to compare each number with all those that 
precede it. However, this involves a large number of comparisons. 

The number of comparisons can be reduced by using a binary tree. The first num- 
ber in the list is placed in a node that is established as the root of a binary tree with 
empty left and right subtrees. Each successive number in the list is then compared to 
the number in the root. If it matches, we have a duplicate. If it is smaller, we examine 
the left subtree; if it is larger, we examine the -ight subtree. If the subtree is empty, tht 
number is not a duplicate and is placed into a new node at that position in the tree. If the 
subtree is nonempty, we compare the number to the contents of the root of the subtree 
and the entire process is repeated with the subtree. An algorithm for doing this follows. 


/* read the first number and insert it */ 
/* into a single-node binary tree */ 
scanf("%d", &number); ' 
tree = maketree(number); 
while (there are numbers left in the input) { 
scanf("%d", &number) ; 
p = q = tree; 
while (number != info(p) & q != NULL) { 
p= q; 
if Cite < info(p)) 
= left(p); 
els 
q = right(p); 
} /* end while */ 
if (number==info(p)) 
printf("%d %s\n", number, "is a duplicate"); 
/* insert number to the right or left of p */ 
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else if (number < info(p)) 
setleft(p, number); 
else f 
setright(p, number); 
} /* end while */ 


Figure 5.1.6 illustrates the tree constructed from the input 14, 15, 4, 9, 7, 18, 3, 
5, 16, 4, 20, 17, 9, 14, 5. : 

Another common operation is to traverse a binary tree; that is, to pass through the 
tree, enumerating each of its nodes once. We may simply wish to print the contents of 
each node as we enumerate it, or we may wish to process it in some other fashion. In 
either case, we speak of visiting each node as it is enumerated. 

The order in which the nodes of a linear list are visited in a traversal is clearly from 
first to last. However, there is no such “natural” linear order for the nodes of a tree. Thus, 
different orderings are used for traversal in different cases. We shall define three of these 
traversal methods. In each of these methods, nothing need be done to traverse an empty 
binary tree. The methods are all defined recursively, so that traversing a binary tree 
involves visiting the root and traversing its left and right subtrees. The only difference: 

-among the methods is the order in which these three operations are performed. 
To traverse a nonempty binary tree in preorder (also known as depth-first order), 
_ we perform the following three operations: 
1. Visit the root. f 
2. Traverse the left subtree in preorder. 
3. Traverse the right subtree in preorder. 


To traverse a nonempty binary tree in inerder (or symmetric order): 


1. Traverse the left subtree in inorder. 
2. Visit the root. 
3. Traverse the right subtree in inorder. 


Figure 5.1.6 Binary tree constructed for finding duplicates. 
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Preorder: ABDGCEHIF Q 
inorder: DGBAHEICF 
Postorder: GDBHIEFCA 


Preorder: ABCEIFJDGHKL 
Inorder: EICFJBGDKHLA 


Figure 5.1.7 Binary trees and their 
Postorder: /EJFCGKLHDBA - traversals, 


To traverse a nonempty binary tree in postorder: 


1. Traverse the left subtree in postorder. 
.2. Traverse the right subtree in postorder. 
3. Visit the root. 


Figure 5.1.7 illustrates two binary trees and their traversals in preorder, inorder, and 
postorder. . 

Many algorithms that use binary trees proceed in two phases. The first phase 
builds a binary tree, and the second traverses the tree. As an example of such an al- 
gorithm, consider ‘the following sorting method. Given a list of numbers in an input 
file, we wish to print them in ascending order. As we read the numbers, they can be 
inserted into a binary tree such as the one of Figure 5.1.6. However, unlike the previous 
algorithm used to find duplicates, duplicate values are also placed in the tree. When 
a number is compared with the contents of a node in the tree, a left branch is taken if 


« 
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Figure 5.1.8 Binary tree constructed for sorting. 


the number is smaller than the contents of the node and a right branch if it is greater or 
equal to the contents of the node. Thus if the input list is 


1415497 1835 16420179145 


the binary tree of Figure 5.1.8 is produced. 

Such a binary tree has the property that all elements in the left subtree of a node n 
are less than the contents of n, and all elements in the right subtree of n are greater than 
or equal to the contents of n. A binary tree that has this property is called a binary search 
tree. If a binary search tree is traversed in- inorder (left, root, right) and the contents of 
each node are printed as the node is visited, the numbers are printed in ascending order. 
Copvince yourself that this 1s the case for the binary search tree of Figure 5.1.8: Binary 
Search trees and their use in sorting and searching are discussed further in Sections 6.3 
and 7.2. 

As another application of binary trees, consider the following method of repre- 
senting an expression containing operands and binary operators by a strictly binary 
tree. The root of the strictly binary tree contains an operator that is to be applied to the 
results of evaluating the expressions represented by the left and right subtrees. A node 
representing an operator is a nonleaf, whereas a node representing an operand is a leaf. 
Figure 5.1.9 illustrates some expressions and their tree representations. (The character 
“$” is-again used to represent exponentiation.) 
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(b) (A +B) C 


(d) (A tB*C)SUA 4+ B)s¢) 


Figure 5.1.9 Expressions and their binary tree representation. 
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Let us see what happens when these binary expression trees are traversed., 
Traversing such a tree in preorder means that the operator (the root) precedes its two 
operands (the subtrees). Thus a preorder trav <rsal yields the prefix form of the expres- 
sion. (For definitions of the prefix and postfix forms of an arithmetic expression, see 
Sections 2.3 and 3.3.) Traversing the binary trees of Figure 5.1.9 yields the prefix forms 


+A * BC (Figure 5.1.9a) ` 

* + ABC (Figure 5.1.9b) 
+A*-—BC$D*EF (Figure 5.1.9c) 
$ +A*BC*+ABC (Figure 5.1.9d) 


Similarly, traversing a binary expression tree in postorder places an operator after 
its two operands, so that a postorder traversal produces the postfix form of the expres- 
sion. The postorder traversals of the binary trees of Figure 5.1.9 yield the postfix forms 


ABC * + (Figure 5.1.9a) 
AB + C* (Figure 5.1.9b) 
ABC — DEF *$ * + (Figure 5.1.9c) 
ABC * +AB+C*$ (Figure 5.1.9d) 


What happens when a binary expression tree is traversed i. inorder? Since the 
root (operator) is visited after the nodes of the left subtree and before the nodes of the 
right subtree (the two operands), we might expect an inorder traversal to yield the infix 
form of the expression. Indeed, if the binary tree of Figure 5.1.9a is traversed, the infix 
expression A + B * C is obtained. However. a binary expression tree does not contain 
parentheses, since the ordering of the operations is implied by the structure of the tree. 
Thus an expression whose infix form requires parentheses to ovérnde explicitly the 
conventional precedence rules cannot be retrieved by a simple inorder traversal. The 
inorder traversals of the trees of Figure 5.1.9 yield the expression». 


A+ B*C (Figure 5.1.9a) 
A+B*C (Figure 5.1.9b) 
A+B>~C*DS*E*F (Figure 5.1.9c) 
A+B*xC$A+B*C (Figure 5.1.9d) 


which are correct except for parentheses. 


EXERCISES ^ 


5.1.1. Prove that the root of a binary tree is an ancestor of every node it the tree except itself. 
5.1.2. Prove that a node of a binary tree has at most one father. 
5.1.3. How many ancestors does a node at level 7 in a binary tree have? Prove your answer. 
5.1.4. Write recursive and nonrecursive algorithms to determine: 

(a) The number of nodes in a binary tree 

(b) The sum of the contents of all the nodes in a binary tree 

(c) The depth of a binary tree 
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5.1.5. Write an algorithm to determinc if a binary tree is , 
(a) Strictly binary N 
(b) Complete 
(c) Almost complete 

5.1.6. Prove that a strictly binary tree with 7 leaves contains 2n — 1 nodes. 


5.1.7. Given a strictly binary tree with n leaves. let /evel(i) for i between 1 and n equal the 
level of the ith leaf. Prove that 


n 
MILJA =| 
2 Keve 
i=l. 


§.1.8.° Prove that the —* an almost complete strictly binary tree with 7 leaves can be 
numbered from | to 2n.- 1 in such a way that the number assigned to the left son 
of the node numbered i is 2i and the number assigned to the right son of abs node. 
numbered / is 2i + 1. 

5.1.9. Two binary trees are similar if they are both empty or if they are both nonempty, their 
left subtrees are similar, and their right subtrees are similar. Wnte an algorithm to 
determine if two binary trees are similar, 

5.1.10. Two binary trees are mirror similar if they are both empty or if they are both nonempty 
and the left subtree of each is mirror similar to the right subtree of the other. Write an 
algorithm to determine if two binary trees are mirror similar. , 

5.1.11. Write algorithms to determine whether or not one binary tree is similar and mirror 
similar (see the previous exercises) to some subtree of another. 

5.1.12. Develop an algorithm to find duplicates in a list of numbers without using a binn 
tree. If there are n distinct numbers in the list. how many times must two numbers be 
compared for equality in your algorithm? What if all n numbers are equal? 

5.1.13. (a) Write an algorithm that accepts a pointer to a binary search tree and deletes the 

smallest element from the tree. ' 

(b) Show how to implement an ascending priority queue (see Section 4.1) as a binary 
search tree. Present algorithms for the operations pginsert and pqmindelete on a 
binary search tree. 

5.1.14. Write an algorithm that accepts a binary tree representing an expression and returns the 
infix version of the expression that contains only those parentheses that are necessary. 


5.2 BINARY TREE REPRESENTATIONS 


In this section we examine various methods of implementing binary trees in C and 
present routines that build and traverse binary trees. We also present some additional 
applications of binary trees. 


Node Representation of Binary Trees 

As is the case with list nodes, tree nodes may be implemented as array elements 
or as allocations of a dynamic variable. Each node contains info. left, right, and father 
fields. The /eft, right, and father fields of a node point to the node’s left son, right son, 


and father, respectively. Using the array implementation, we may declare 
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#define NUMNODES 500 
struct nodetype { 
int info; 
int left; 
int right; 
int father; 
}; 
struct nodetype node [NUMNODES]; 


Under this representation, the operations info(p), left(p), right(p), and 
father(p) are implemented by references to node|p).info, node p).left, node[p].right, 
and node(p).father, respectively. The operations isleft(p), isright(p), and brother(p) can 
be implemented in terms of the operations /eft(p), right(p), and father(p), as described 
in the preceding section. 

To implement is/eft and isright more efficiently, we can also include within each 
node an additional flag isleft. The value of this flag is TRUE if the node is a left son and 
FALSE otherwise. The root is uniquely identified by a NULL value (—1) in its father 
field. The external pointer to a tree usually points to its root. 

Alternatively, the sign of the father field could be negative if the node is a left son 
or positive if it is a right son. The pointer to a node’s father is then given by the absolute 
value of the father field. The isleft or isright operations would then need only examine 
the sign of the father field. 

To implement brother(p) more efficiently, we can also include an additional 
brother field in each node. 

Once the array of nodes is declared, we could create an available list by executing 
the following statements: 


int avail, i; 


{ 
avail = 1; 
for (i=0; i < NUMNODES; i++) 
node[iJ].left = i + 1; 
node[NUMNODES-1]. left = 0; 


The functions getnode and freenode are straightforward and are left as exercises. Note 
that the available list is not a binary tree but a linear list whose nodes are linked together 
by the Jeff field. Each node in a tree is taken from the available pool when needed and 
returned to the available pool when no longer in use. This representation is called the 
linked array representation of a binary tree. 
“Alternatively, a node may be defined by 
struct nodetype { 
int info; 
_ Struct nodetype *left; 
struct nodetype *right; 
struct nodetype.*father; 


i 
typedef struct nodetype *NODEPTR; 
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The operations info(p), left(p). right(p), and father(p) would be implemented by refer- 
ences to p— >info, p— >left. p— >right, and p— >father, respectively. Under this im- 
plementation, an explicit available list is not needed. The routines getnode and freenode 
simply allocate and free nodes using the routines malloc and free. This representation 
is called the dynamic node representation of a binary tree. 

Both the linked array representation and the dynamic node representation are im- 
plementations of an abstract linked representation (also called the node representa- 
tion) in which explicit pointers link together the nodes of a binary tree. 

We now present C implementations of the binary tree operations under the dy- 
namic node representation and leave the linked array implementations as simple exer- 
cises for the reacer. The makerree function, which allocates a node and sets it as the 
root of a single-node binary tree, may be written as follows: 


NODEPTR maketree(int x) 
{ 
NODEPTR p; 


p = getnode() ; 
p->info = x; 
p->left = NULL; 
p->right = NULL; 
return(p); j 
- } /* end maketree */ 


The routine setleft( p,x) sets a node with contents x as the left son of node( p): 
void setleft(NODEPTR p, int x) 
{ 


if (p == NULL) 
printf("void insertion\n"); 
else if (p->left != NULL) 
printf ("invalid insertion\n"); 
else 
p->left = maketree(x); 
} /* end setleft */ 


The routine serright( p,x) to create a right son of node( p) with contents x is similar 
and is left as an exercise for the reader. 

It is not always necessary to use father, left, and right fields. If a tree is always 
traversed in downward fashion (from the root to the leaves), the father operation is 
never used; in that case, a father field is unnecessary. For example, preorder, inorder, 
and postorder traversal do not use the father field. Similarly, if a tree is always traversed 
in upward fashion (from the leaves to the root), /eft and right fields are not needed. The 
isleft and isright operations could be implemented even without left and right fields 
by using a signed pointer in the father field under the linked array representation, as 
discussed earlier: a right son contains a positive father value and a left son a negative 
father field. Of course, the routines maketree, setleft, and setright must then be suitably 
modified for these representations. Under the dynamic node representation, an is/eft 
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logical field is required in addition to father if left and right fields are not present and 
it is desired to implement the isleft or isrighi operations. i 

The following program uses a binary search tree to find duplicate numbers in 
an input file in which-each number is on a separate input line. It closely follows the 
algorithm of Section 5.1. Only top-down links are used: therefore no father field is 
needed. i mane eres 


struct nodetype { 
int. info;* 
struct nodetype “left; 
struct nodetype *right; 
}; 


typedef struct nodetype *NODEPTR; 


main() 

{ 

NODEPTR ptree; 
NODEPTR p, q; 
int number; 


scanf ("%d", &number); 
ptree = maketree(number); 
while (scanf("%d", &number) != EOF) { 
p = q = ptree; 
while (number != p->info && q != NULL) { 
p=q 
if (number < p->info) 
q = p->left; 
else 
q = p->right; 
} /* end while */ 
if (number == p->info) 
printf("%d is a duplicate\n", number); 
else if (number < p->info) 
setleft(p, number); 
else 
setright(p, number); 
} /* end while */° 
} /* end main */ 


Internal and External Nodes 


By definition leaf nodes have no sons. Thus, in the linked representation of 
binary trees, left and right pointers are needed only in nonleaf nodes. Sometimes 
two separate sets of ‘nodes are used for nonleaves and leaves. Nonleaf nodes con- 
tain info, left, and right fields (often no information is associated with nonleaves. 
so that an info field is unnecessary) and are allocated as dynamic records or as an 
array of records managed using an available list. Leaf nodes do not contain a left or 
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right field and are kept as a single info array that is allocated sequentially as needed 
(this assumes that leaves are never freed, which is often the case). Alternatively, they 
can be allocated as dynamic variables containing only an info value. This saves a great 
deal of space, since leaves often represent a majority of the nodes in:a binary tree. Each 
(leaf or nonleaf) node can also contain a father field, if necéssary. f 

When this distinction is made between nonleaf a by ha nonleaves are 
ca’ ed infernal nodes and leaves ate called external nodes. The terminology is also 
often used even when only a single type Piode is defined. Of course, a son pointer 
within an internal node must be identified as pointing to an internal or an external. node. 
This can be done in C in two ways. One technique is to declare two different node types 
and pointer types and to use a union for internal nodes, with each alternative containing 
one of the two pointer types. The other technique is to retain a single type of pointer 
and a single type of node, where the node is a union that does (if the node is an internal 
node) or does not (if an external node) contain left and right pointer fields. We wil! see 
an example of this latter technique at the end of this section. 


Imp‘icit Array Representation of Binary Trees n à 

Recall from Section 5.1 that the n nodes of an almost complete binary tree can 
be numbered from 1 to n, so that the number assigned a left son is twice the number 
assigned its father, and the number assigned a right son is | more than twice the number 
assigned its father. We can represent an almost complete binary tree without father, left, 
or right links. Instead, the nodes can be kept in an array info of size n. We refer to the 
node at position p simply as “node p.” info[ p} holds the contents of node p. 

In C, arrays start at position 0; therefore instead of numbering the tree nodes from 
l ton, we number them from 0 to n — 1. Because of the one-pesition shift, the two sons 
of a node numbered p are in positions 2p + 1 and 2p + 2, instead of 2p and 2p + 1. 

The root of the tree is at position 0, so that rree, the external pointer to the tree 
root, always equals 0. The node in position p (that is, node p) is the implicit father 
of nodes 2p + 1 and 2p + 2. The left son of node p is node 2p + I and its right son 
is node 2p + 2. Thus the operation /eft(p) is implemented by 2 * p +.1 and right(p) 
by 2 * p + 2. Given a left son at position p, its right brother is at p + l and, given 
a right son at position p, its left brother is at p — 1. father(p) is implemented by 
(p — 1) 2. p points to a left son if and only if p is odd. Thus, the test for whether node 
p is a left son (the isleft operation) is to check whether p%2 is not equal to 0. Figure 
5.2.1 illustrates arrays that represent the almost complete binary trees of Figure 5.1.5c 
and d. i 

We can extend this implicit array representation of almost complete binary trees 
to an implicit array representation of binary trees generally. We do this by identifying 
an almost complete binary tree that contains the binary tree being represented. Figure 
5.2.2a illustrates two (non-almost-complete) binary trees, and Figure 5.2.2b illustrates 
the smallest almost complete binary trees that contain them. Finally, Figure 5.2.2c 
illustrates the implicit array representations of these almost complete binary trees, and, 
by extension, of the original binary trees. The implicit array representation is also called 
the sequential representation, as contrasted with the linked representation presented 
earlier, because it allows a tree to be implemented in a contiguous block o! memory (an 
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array) rather than via pointers connecting widely separated nodes. Under the sequen- 
tial representation, an array element is allocated whether or not it serves to contain 
node of a tree. We must, therefore, flag Unused array elements as nonexistent, or null, 
tree nodes. This may be accomplished by one of two methods. One method is to set 
info[p] to a special value if node p is null. This special value should be invalid as the 
information content of a legitimate tree node. For example, in a tree containing posi- 
tive numbers, a null node may be indicated by a negative info value. Alternatively, we 
may add a logical flag field, used, to each node. Each node then contains two fields: 
info and used. The entire structure is contained in an array node. used(p), implementea 
as node[p].used, is TRUE if node p is not a null node and FALSE if it is a null noge. 
info(p) is implemented by node[p].info. We use this latter method in implementing the 
sequential representation. __ 

We now present the program to find duplicate numbers in an input list, as well 


as the routines -maketree and setleft, using the sequential representation of binary 
trees. f 


#define NUMNODES 500 
struct nodetype { 
int info; 
int used; 
} node [NUMNODES] ; 


void maketree(int); 
void setleft(int, int); 
void setright(int, int); 


main() 
{ 
int p, q, number; 


scanf("%d", &number) ; 
maketree (number) ; 
while (scanf("%d", &number) != EOF) { 
p=q=0; j 
while (q < NUMNODES & node[q].used & number != node[p].info) { 
p =q; 
if (number < node[p].info) 
q=2*p+1; 
else 
q=2*p+2; 
} /* end while */ 
/* if the number is in the tree it is a duplicate */ 
if (number == node[p]. info) 
printf("%d is a duplicate\n", number); 
else if (number < node[p] .info) 
setleft(p, number); 
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else 
setright (p, number); 
} /* end while */ 
} /* end main */ 


void maketree(int x) 


{ 
int p; 


node[0].info = x; 
node[0].used = TRUE; 
/* The tree consists of node 0 alone. */ 
/* All other nodes are null nodes = */ 
for (p=1; p < NUMNODES; p++) 
node[p].used = FALSE; 
} /* end maketree */ 


void setleft(int p, int x) 


{ 
int q; 


q=2*p+1; /* Q is the position of the left son */ 
if (q >= NUMNODES) 
error("array overflow"); 
else if (node[q].used) 
error("invalid insertion"); 
else { 
node[q].info = x; 
node[q].used = TRUE; 
} -/* end if */ 
} /* end setleft */ 


The routine for setright is similar. 

Note that under this implementation, the routine maketree initializes the fields info 
and used to represent a tree with a single node. It is no longer necessary for maketree to 
return a value, since under this representation the single binary tree represented by the 
info and used fields is always rooted at node 0. That is the reason that p is initialized 
to 0 in the main function before we move down the tree. Note also that under this 
representation it is always required to check that the range (VUMNODES) has not been 
exceeded whenever we move down the tree. 


Choosing a Binary Tree Representation 


Which representation of binary trees is preferable? There is no general answer 
to this question. The sequential representation is somewhat simpler, although it is nec- 
essary to ensure that all pointers are within the array bounds. The sequential representa- 
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tion clearly saves storage space for trees known to be almost complete, since it elim- 
inates the need for the fields teft, right, and father and does not even requite a used 
field. It is also space efficient for trees that are only a few nodes short of being al- 
most complete, or when nodes are successively eliminated from a tree that originates 
as almost complete, although a used field might then be required. However, the se- 
quential representation can only be used in a context in which only a single tree is re- 
quired, or where the number of trees needed and each of their maximum sizes is fixed in 
advance. 

By contrast, the linked representation requires left, right, and father fields (al- 
though we have seen that one or two of these may be eliminated in specific situations) 
but allows much more flexible use of the collection of nodes. In the linked represen- 
tation, a particular node may be placed at any location in any tree, whereas in’the se- 
quential representation a node can be utilized only if it is needed at a specific location 
in a specific tree. In addition, under the dynamic node representation the total num- 
ber of trees and nodes is limited only by the amount of available memory. Thus the 
linked representation is preferable in the general, dynamic situation of many trees of 
unpredictable shape. 

Thé duplicate-finding program is a good illustration of the trade-offs involved. 
The first program presented utilizes the linked representation of binary trees. It requires 
left and right fields in addition to info (the father field was not-necessary in that pro- 
gram). The second duplicate-finding program that utilizes the sequential representation 
requires only an additional field, used (and this too can be eliminated if only positive 
numbers are allowed in the input, so that a null tree node can be represented by a spe- 
cific negative info value). The sequential representation can be used for this example 
because only a single tree is required. 

However, the second program might not work for as many input cases as the 
first. For example, suppose that the input is in ascending order. Then the tree formed 
by either program has all null left subtrees (you are invited to verify that this is the 
case by simulating the programs for such input). In that case the only elements of info 
that are occupied under the sequential representation are 0. 2, 6. 14, and so on (each 
position is two more than twice the previous one). If the value of NUMNODES is kept 
at 500, a maximum of only 16 distinct ascending numbers can be accommodated (the 
Jast one will be at position 254). This can be contrasted with the program using the 
linked representation, in which up to 500 distinct numbers in ascending order can be 
accommodated before it runs out of space. In the remainder of the text, except as noted 
otherwise, we assume the linked representation of a binary tree. 


„Binary Tree Traversals in C 


We may implement the traversal of binary trees in C by recursive routines thet 
mirror the traversal definitions. The three C routines pretray, intrav. and posttray print 
the contents of a binary tree in preorder, inorder. and postorder, respectively. The pa- 
rameter to each routine is a pointer to the root node of a binary tree. We use the dynamic 
node representation of a binary tree: 
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void pretrav(NODEPTR tree) 
{ 


if (tree '= NULL) { 
printf("%d\n", tree->info); 
pretrav(tree->left); 
pretrav(tree->right); 
} /* end if */ 
} /* end pretrav */ 


void intrav(NODEPTR tree) 
{ 
if (tree != NULL) { 
jntrav(tree->] eft); 
printf("%d\n", tree->info); 
intrav(tree->right) ; 
} /* end if */ 
} /* end intrav */ 


void posttrav(NODEPTR tree) 
{ . 
if (tree != NULL) { 
posttrav(tree->left) ; 
posttrav(tree->right); - 
printf("%d\n", tree->info); 
} /* end if */ 
. } /* end posttrav */ 


/* visit the root */ 
/* traverse left subtree */ 
/* traverse right subtree */ 


bad 

/* traverse left subtree */ 
/* visit the root a} 

/* traverse right subtree */ 


/* traverse left subtree */ 
/* traverse right subtree */ 
/* visit the root */ 


The reader is invited to simulate the actions of these routines on the trees of Figures 
5.1.7 and 5.1.8. 
Of course, the routines could be written nonrecursively to perform the necessary 


stacking and unstacking explicitly. For example, the following is a nonrecursive routine 
to traverse a binary tree in inorder: 


#define MAXSTACK 100 


void intrav2(NODEPTR tree) 


{ 
struct stack { 
int top; 
NODEPTR item[MAXSTACK] ; 
$ sS; 
NODEPTR p; 
s.top = -1; 
p = tree; 
do { 
/* travel down left branches as far as possible */ 
y" saving pointers to nodes passed 3 
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while (p != NULL) { 
push (s, p); . 
p = p->left; 

.} /* end while */ 

/* check if finished */ 

if (tempty(s)) { 
/* at this point the left subtree is empty */ 
p = pop(s); 
printf("%d\n", p->info); /* visit the root */ 
p = p->right; /* traverse right subtree */ 

} /* end if */ 

} while (!empty(s) || p != NULL); 
} /* end intrav2 */ 


Nonrecursive routines to traverse a binary tree in postorder and preorder as well 
as nonrecursive traversals of binary trees using the sequential representation are left as 
exercises for the reader. 

intrav and intrav2 represent an excellent contrast between a recursive routine and 
its nonrecursive counterpart. If both routines are executed, the recursive intrav generally 
executes much more quickly than the nonrecursive intrav2. This goes against the ac- 
cepted “folt wisdom” that recursion is slower than iteration. The primary cause of the in- 
efficiency of intrav2 as written is the calls to push, pop, and empty. Even when the code 
for these functions is inserted in-line into intrav2, intrav2 is still slower than intrav 
because of the often superfluous tests for overflow and underflow included in that code. 

Yet, even when the underflow/overflow tests are removed, intrav is faster than 
intrav2 under a compiler that implements recursion efficiently! The efficiency of the 
recursive process in this case is due to a number of factors: 


1. There is no “extra” recursion, as there is in computing the Fibonacci numbers, 
where f(n — 2) and f(n— 1) are both recomputed separately even though the value 
of f(n — 2) is used in computing f(n — 1). l l 

2. The recursion stack cannot be entirely eliminated, as it can be in computing the 
factorial function. Thus the automatic stacking and unstacking of built-in recur- 
sion is more efficient than the programmed version. (In many systems, stacking 
can be accomplished by incrementing the value of a register that points to the 
stack top and moving all parameters into a new data area in a single block move. 
Program-controlled stacking as we have implemented it requires individual as- 
signments and increments.) 

3. There are no extraneous parameters and local variables, as there are, for exampie, 
in some versions of binary search. The automatic stacking of recursion does not 
“stack any more variables than are necessary. 


In cases of recursion that do not involve this excess baggage, such as inorder 
traversal, the programmer is well advised to use recursion directly. 

The traversal routines that we have presented are derived directly from the def- 
initions of the traversal methods. These definitions are in ‘terms of the left and right 
sons of a node and do not reference a node’s father. For that reason, both the recursive 
and nonrecursive routines do not require a father field and do not take advantage of such 
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a 
a field even if it is present. As we shall soon see, the presence of a father field allows 
us to develop nonrecursive traversal algorithms without using a stack. However, we , - 
first examine a technique for eliminating the stack in a nonrecursive traversal even if a 
father field is not available. ae 


Threaded Binary Trees 4 : 

Traversing a binary tree is a common operation, and it would be helpful to find 
a more efficient method for implementing the traversal. Let us examine the function 
intrav2 to discover the reason that a stack is needed. The stack is popped when p equals 
NULL. This happens in one of two cases. In one case, the while loop is exited after 
having been executed one or more: times. This implies that the program has traveled - 
down left branches until it reached a NULL pointer, stacking a pointer to each node 
as it was passed. Thus, the top element of the stack is the value of p before it became 
NULL. If an auxiliary pointer g is kept one step behind p, the value of g can be used 
directly and need not be popped. 

The other case in which p is NULL is that in which the while loop is skipped 
entirely. This occurs after reaching a node with an empty right subtree, executing the 
statement p = p— >right, and returning to repeat ihe body of the do while loop. At this 
point, we would have lost our way were it not for the stack whose top points to the node 
whose left subtree was just traversed. Suppose, however, that instead of containing a 
NULL pointer inits right field, a node with an empty right subtree contained in its right 
field a pointer to the node. that would be on top of the stack at that point in the algorithm 
(that is, a pointer to its inorder successor.) Then there would no longer be a need for the 
stack, since the last node visited during ‘a traversal of a left subtree points directly to its 
inorder successor. Such a pointer is called a thread and must be differentiable from a 
tree pointer that is used to link a node to its left or right subtree. 

Figure.5.2.3, shows the binary trees of Figure 5.1.7 with threads repiaciig NULL 
pointers in nodes with empty right subtrees. The threads are drawn with dotted lines 
to differentiate them from tree pointers. Note that the rightmost node:in each tree still 
has a NULL right pointer, since it has no inorder successor. Such trees are called right 

in-threaded binary trees. f 

To implement a fight in-threaded binary tree under the dynamic node implemen- 
tation of a binary tree, an extrailogical field, rthread, is included within each node to 
indicate whether or not its right pointer is a thread. For consistency, the rthread field of 
the rightmost node ofa tree (that is, the last node in the tree’s inorder traversal) is also 
set to TRUE, although its right field remains NULL. Thus a node is defined as follows 
(recall that we are assuming that no father field exists): 


struct nodetype { 
int info; 
Struct nodetype *left; /* pointer to left son */ 
Struct nodetype *right; -/* pointer to right son */ 


int rthread; ; /* thread is TRUE if */ 
/* „right is NULL or */ 


} /* a -mon-NULL thread +/ 
typedef struct nodetype *NODEPTR; 3 
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Figure 5.2.3 Right in-threaded binary trees. 


We present a routine to implement inorder traversal of a right in-threaded binary 


tree, 


void intrav3(NODEPTR tree) 
{ 


NODEPTR p, q; 


274 Trees Chap. 5 


do { 
; a = NULL; 
whRen(Gesaaby e p Derai arene”. 
q =p; a 
p = p->left; . 


} /* end while */_ 
if (Q t= NULL) { 
printt("%d\n", q->into); 
p = q->right; 
while (q->rthread & p != NULL) { 
printf ("Xd\n | proturays 
Q=?; 
p = p->right; 
J p^ etd mnie y 
} /* end if */ 
} while (a != NULL) 
} /* end intrav3 */ 


In a right in-threaded binary tree the inorder successor of any node car be found 
eficivimiy: Gach waroe can alse ve conserneted in a straightforward manner. The routines 
maketree, setleft, and setright are as follows. We assume info. left, right, and rthread 
fields in each node. 


NODEPTR maketree(int x) 
{ 
NODEPTR p; 


p = getnode(); 
p->info=x; 
p->left = NULL; 
p->right = NULL; 
prt unea = mur, 
return(p); 

} /* end maketree */ 


void setleft(NODEPTR p, int x) 


POREOT ONY 
if (p == NULL) 
error("void insertion"); 
else if (p->left != NULL) 
error("invalid insertion”); 


else { 
q = getnode(); 
g->info = x; 
p->left = q; 


q->left = NULL; 
/* The inorder successor of node(q) is node(p) */ 
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Q->right = p; 
q->rthread = TRUE; 
} /* end if */ 
} /* end setleft */ 


‘void setright(NODEPTR p, int x) 
{ 
NODEPTR q, r; 


if (p == NULL) 
error("void insertion") 

else if (!p->rthread) 
error("invalid insertion"); 

else { 
q = getnode(); 
q->info = x; 
/* save the inorder successor of node(p) wf 
r = p->right; 
p->right = q; 
p->rthread = FALSE; 
.q->left, = NULL; 
/* The inorder successor of node(q) is the */ 
/* previous successor of node(p) +] 
q->right = r; 
q->rthread = TRUE; 

} /* end else */ 

} /* end setright */ 


In the linked array implementation, a thread can be represented by a negative 
value of node[p].right. The absolute value of node(p].right is the index in the array 
node of the*inorder successor of node[p]. The sign of node[p}.right indicates whether 
its absolute value represents a thread (minus) or a pointer to a nonempty subtree (plus). 
Under this implementation, the following routine traverses a right in-threaded binary 
tree in inorder. We leave mgketree, setleft, and setright for the linked array representa- 
tion as exercises for the reader. 


void intrav4(int tree) 
{ 
int p, q; 
p = tree; 
do { e 
/* travel down left links keeping q behind p */ 
q=0; . ‘ 
while (p != 0) { 
q= p; 


. p= node[p]. left; 
} /* end while */ 
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if (q != 0) { /* check if finished */ 
printf("%d\n", node[q]. info); è 
p = node[q].right; ža 
while (p < 0) { 
q = -pi 
printf("%d\n", node[q]. info); 
p = node[q].right; 
} /* end while .*/ 
} /tendif*7 `~ 
/* traverse right subtree */ 
} while (q != 0); 
} /* end intrav4 */ 


ra 


Under the sequentiaf representation of binary trees, the used field indicates 
threads by means of negative or positive values. If i represents a node with a right 
son, node|i].used equals 1, and its right son is:at 2 * i + 2. However, if i represents a 
node with no right son, Aode[i].used contains the negative of the index of its inorder 
successor. (Note that use of negative numbers allows us to distinguish a node with a 
right son from a node whose inorder successor is the root of the tree.) If i is the rightmost 
node of the tree, so that it has no inorder successor, node[i}.used can contain the special 
value +2. If i does not represent a node, node[i].used is 0. We leave the implementation 
of traversal algorithms for this representation as an exercise for the reader. 

A left in-threaded binary tree may be defined similarly, as one in which each 
NULL left pointer is altered to contain a thread to that node’s inorder predecessor. An 
in-threaded binary tree may then béefined as a binary tree that is both left in-threaded 
and right in-threaded. However, left in-threading does not yield the advantages of right 
in-threading. i : l 

We may also define right and left pre-threaded binary trees, in which NULL ri ght 
and left pointers of nodes are replaced by their preorder successors and predecessors 
respectively. A right pre-threaded binary tree may be traversed efficiently in preorder 
without the use of a go A right in-threaded binary tree may also be traversed in 

u 


preorder without the ust of a stack. The traversal algorithms are left as exercises for the 
reader. 


Traversal Using a father Field 


If each tree node contains“a father field, neither a stack nor threads are necessary 
for nonrecursive traversal. Instead, when the traversal process reaches a leaf node, the 
Sather field can be used to climb back up the tree. When node(p) is reached from a left 
son, its right subtree must stiJ] be traversed; therefore the algorithm proceeds to right(p). 
When node(p) is reached from its right son, both its subtrees have been traversed and the 


algorithm backs up further to father(p). The following routine implements this process 
for inorder traversal. 


‘void intravS(NODEPTR tree) 
{= 


NODEPTR p, q; 
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while (p != NULL) { i 
q =p; 
p = p->left; 
} /* end while */ 
if (q != NULL) { 
printf("%d\n", q->info); 
p = q->right; 
} /* end if */ 
while (q != NULL & p == NULL) { 
do { 
` /* node(q) has no right son. Back up until a */ 
/* left son or the tree root is encountered */ 
p=4q; 
q = p->father; 
} while (!isleft(p) && q != NULL); 
if (q != NULL) { 
printf("%d\n", q->info); 
p = q->right; 
} /* end if */ 
} /* end while */ 
} while (q != NULL); 
} /* end intravS */ 


Note that we write isleft(p) rather than p— >isleft because an isleft field is unnec- 
essary to determine if node(j) is a left or a right son; we can simply check if the node 
is its father’s left son. 

In this inorder traversal a node is visited [printf (“%d\ n”, q— >info)] when its 
left son is recognized as NULL or when it is reached after backing up from its left son. 
Preorder and postorder traversal are similar except that, in preorder, a node is visited 
only when it is reached on the way down the tree and, in postorder, a node is visited 
only when its right son is recognized as NULL or when it is reached after backing up 
from its right son. We leave the details as an exercise for the-reader. 

Traversal using father pointers for backing up is less time efficient than traversal 
of a threaded tree. A thread points directly to a node’s successor, whereas a whole series 
of father pointers may have to be followed to reach that successor in an unthreaded tree. 
It is difficult to compare the time efficiencies of stack-based traversal and father-based 
traversal, since the former includes the overhead of stacking and unstacking,. 

This backup traversal algorithm also suggests a stackless nonrecursive traversal 
technique for unthreaded trees, even if no farher field exists. The technique is simple: 
simply reverse the son pointer on the way down the tree so that it can be used to find a 
way back up. On the way back up, the pointer is restored to its original value. 

For example, in intravS, a variable f can be introduced to hold a pointer to the 
father of node(q). The statements $s 

q =p; 

p = p->left; 
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in the first while loop can be replaced by 


f = q; 

q =p; 

p = p->left; 

if (p != NULL) 
q->left = f; 


This modifies the left pointer of node(q) to point to the father of node(q) when going 
left on the way down [note that p points to the left son of node(q), so that we have not 
lost our way]. The statement 

P = g->right; 
in both of its.occurrences can be replaced by 

p = q->right? ` 

if (p != NULL) 

q->right = f; 


to similariy modify the right pointer of node(q) to point to its father when going right 
on the way down. Finally, the statements 


pP=4q; 
q = p->father; 


in the inner do-while loop can be replaced by 


P =q; 

q=f; 

if (q != NULL & isleft(p)) {, j 
f = left(q); : 
left(q) = p; - ) ; 

else { 


f= right(q); 
right(q) = p; 
} /* end if */ 


to follow a modified pointer back up the tree and restore the pointer’s value to point to 
its left or right son as appropriate. ; 

However, now an isleft field is required, since the isleft operation cannot be im- 
plemented using a nonexistent father field. Also, this algorithm cannot be used in à 
multiuser environment if several users require access to the tree simultaneously. If one 
user is traversing the tree and temporarily modifying pointers, another user will be un- 
able to use the tree as a coherent structure. Some sort of lockout mechanism is required 
to ensure that no one else uses the tree while pointers ure reversed. ` ? 
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Figure 5.2.4 Binary tree representing 3 + 4*(6 — 7) 5+3. 


y 


Heterogeneous Binary Tr 


Often the information contained in different nodes of a binary tree is not all of the 
same type. For example, in representing a binary expression witłfconstant numerical 
operands we may wish to use a binary tree whose leaves contain numbers but whose 
nonleaf nodes contain characters representing operators, Figure 5.2.4 illustrates such a 
binary tree. 

To represent such a tree in C we may use a union to represent the information por- 
tion of the node. Of course, each tree node must contain within itself a field to indicate 
the type of object that its info field contains. 


#define OPERATOR 0 
#define OPERAND 1 
struct nodetype { 
Short int utype; /* OPERATOR or OPERAND Ry 
union { 
char chinfo; 
k float numinfo; 
} info; 
struct nodetype *left; 
struct nodetype *right; 
}; 
i typedef struct nodetype *NODEPTR; j 
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Let us write a C function evalbintree that accepts a pointer to such a tree and 
returns the value of the expression represented by the tree. The function recursively 
evaluates the left and right subtrees and then applies the operator of the root to the two 
results. We use the auxiliary function oper (symb,opndl,cpnd2) introduced in Section 
2.3. The first parameter of oper is a character Tepresenting an operator, and the last two 
parameters are real numbers that are the two operands. The function oper Feturns the 
result of applying tie operginr ta thetwonnerande === nF PEK 


float evalbintree (NODEPTR tree) 
{ 


} /* end evalbintree */ 


hy 


- float opndl, opnd2; | A i mn% 
char symb; 


if (tree->utype ==. OPERAND) /* expression is a single operand * 
return (tree->numinfo); — . 3 

/* tree->utype == OPERATOR 4/ 

/* evaluate the left subtree _ */ 

opndl = evalbintree(tree->left); 

/* evaluate the right subtree */ 

opnd2 = evalbintree(tree->right); 

symb = tree->chinfo; /* extract the operator */ 

/* apply the operator and return the result ay 

return(oper(symb, opndl, opnd2)); 


& 


Section 9.1 discusses additional metheds of implementing linked Structures that 
contain heterogeneous elements. Note also that, in this example, all the operand nodes 
è. are leaves and all the operator nodes are nonleaves. 


EXERCISES 


5.2.1. 


5.2.2. 


5.2.4. 


5.2.5. 


i Exercises 


d- 
j 
Riy 4 


Write a C function that accepts a pointer to a node and returns TRUE if that node is the 
root of a valid binary tree and FALSE otherwise. s 

Write a C function that accepts a pointer to a binary tree and a pointer to a node of the 
tree and returns the level of the node in the'tree. 

Write a C function that accepts a pointer to a binary tree and returns a pointer to a new 
binary tree that is the mirror image of the first (that is, all left subtrees are now right 
subtrees and vice versa). 

Write C functions that convert a binary tree implemented ising the linked array rep- 
resentation with only a father field (in which the left son’s father field contains the 
negative of the pointer to its father and a right son's father contains a pointer to its 
father) to its representation using left and right fields, and vice versa. 

Write a C program to perform the following experiment: Generate 100 random num- 
bers. As each number is generated, insert it into an initially empty binary search tree. 
When all 100 numbers have been inserted, print the level of the leaf with the largest 
level and the level of the leaf with the smallest level. Repeat this process 50 times. Print 
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5.2.6. 
5.2.7. 


5.2.8. 


5.2.9. 


5.2.10. 


out a table-with a count of how many of the 50 runs resulted in a difference between 

the maximum and minimum leaf level of 0, 1, 2, 3, and soon. 

Write C routines to traverse a binary tree in preorder and postorder. 

Implement inorder traversal, maketree, setleft, and setright for right inethreaded binary 

trees under the sequential representation. 

Write C functions to efeate a binary tree given: f 

(a) The preorder and inorder traversals of that tree 

(b) Thé preorder and postorder traversals of that tree 

Each function should accept two character strings as parameters. The tree created 

Should contain a single character in each node. 

The solution to the Towers of Hanoi problem for n disks (see Sections 3.3 and 3.4) can 

be represented by a complete binary tree of level n — 4 as follows. 

(a) Let the root of the tree represent a move of the top disk on peg frompeg to peg 
topeg. (We ignore the identification of the disks being moved, as there is only 
2 single disk [the top one] that can be moved from any peg to any other peg.) 
If nd is a leaf node (at level less than n — 1) representing the movement of the 
top disk from peg x to peg y, let z be the third peg that is neither the source or 
target of node nd. Then /efi(nd) represents a move of the top disk from peg x to 
peg z and right(nd) répresents.a move of the top disk from peg z to peg y. Draw 
sample solution trees as described previously for n = _1,2,3, and 4, and show that 
an inorder traversal of such a tree produces the solution to the Towers of Hanoi 
problem. , “=” 

(b) Write a reċürsive C procedure that accepts a value for n and generates and tra- 
verses the tree as discussed previously. 

(c) Because the tree is complete, it can be stored in an array of size 2”—1. Show 
that the nodes of the tree can be stored in the array so that a sequential traversal 
of the array produces the inorder traversal of the tree, as follows: The root of 
the tree is in position 2"~'—1; for any level j, the first node at that level is in 
position 2”~'-/—] and each successive node at level jis 2”~/ elements beyond 
the previous element at that level. 

(d) Write a nonrecursive C program to create the array as described in part c and 
show that a sequential pass through the array does indeed produce the desired 
solution. 

(e) How could the preceding programs be extended to include within each node the 
number of the disk being moved? ag Na 

In Section 4.5 we introduced a method of representing a doubly linked list with only a 
single pointer field in each node by maintaining its value as the exclusive or of pointers 
to the node's predecessor and successor. A binary tree can be similarly maintained by 
keeping one field in each node: set to the exclusive or of pointers to the node's father 
and left son [call this field fleft(p)] and another field in the node set to the exclusive or 
of pointers to the node's father and right son [call this field Sright(p)). 

(a) Given father(p} and fleft(p), show how to compute left(p). 

-~ Given father(p) and fright(p), show how to compute right(p). 

(b) Given fleft(p) and left(p), show how to.compute father(p). 
Given fright(p) and right(p), show how to compute father(p). 

(© Assume that a node contains only info. left, fright. and isleft fields. Write algo- 
rithms for preorder. inorder. and postorder traversal ofsa binary tree, given an 
external pointer to the tree root, without using a stack or modifying any fields. 

(d) Can the isleft field be eliminated? 


a Ae 
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5.2.11. An index of a textbook consists of major terms ordered alphabetically. Each major 
term is accompanied by a set of page numbers and a set of subterms. The subterms are 
printed on successive lines following the major term and are arranged alphabetically 

~ within the major term. Each subterm is accompanied by a set of page numbers. 
Design a data structure to represent such an index and write a C program to print an 
index from data as follows: Each input line begins with an m (major term) or an s 
(subterm). An m line contains an m followed by a major term followed by an integer 
n (possibly 0) followed by n page numbers where the major term appears. An s line 
is similar except that it contains a subterm rather than a major term. The input lines 
appear in no particular‘order except that each subterm is considered to be a subierm 
of the major term which last precedes it. There may be many input lines for a single 
major term or subterm (all page numbers appearing on any line for a particular term - 
should be printed with that tèrm). ` r 
The index should be printed with one term on a line followed by all the pages on which 
the term appears in ascending order. Major terms should be printed in alphabetical 
order. Subterms should appear in alphabetical order immediately following their major 
term. Subterms should be indented five columns from the major terms. 
The set of major terms should be organized as a binary tree. Each node in the tree 
contains (in addition to left and right pointers and the major term itself) pointers to 
two other binary trees. One of these represents the set of page numbers in which the 
major term occurs, and the other represents the set of subterms of the major term. Each 
node on a subterm binary tree contains (in addition to left and right pointers and the 
subterm itself, a pointer to a binary tree representing the set of page numbers in which 
the subterm occurs. 


Y 
5.2.12. Write a C function to implement the sorting method of Section 5.1 that uses a binary 
search tree. ‘ 

5.2.13. (a) Implement an ascending priority queue using a binary search tree by writing 
C implementations of the algorithms pginsert.and pgmindelete, as in exercise 

5.1.13. Modify the routines to count the number of tree nodes accessed, 

(b) Use a random number generator to test the efficiency of the priority queue im- 
plementation as follows: First, create a priority queue with 100 elements by 
inserting 100 random numbers in an initially empty binary search tree. Then 
call pqmindelete and print the number of tree nodes accessed in finding the min- 
imum element, generate a new random number, and call pqinsert to insert the 
new random number and print the number of tree nodes accessed in the inser- 
tion. Note that after calling pqinsert, the tree still contains 100 elements. Repeat 
the delete/print/generate/inservprint process 1000 times. Note thatthe number 
of nodes accessed in the deletion tends to decrease, while the number of nodes 
accessed in the insertion tends to increase. Explain this behavior.’ 


5.3 EXAMPLF: THE HUFFMAN ALGORITHM 


Suppose that we have an alphabet of n symbols and a long message consisting of sym- 

bols from this alphabet. We wish to encode the message as a long bit string (a bit is 

either 0 or 1) by assigning a bit string code to each symbol of the alphabet and con- 

catenating the individual codes of the symbols making up the message to produce an 

encoding for the message. For example, suppose that the alphabet consists of the four 
. symbols A, B, C, and D and that codes are assigned to these symbols as follows: 
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A 010 > 
TAN B, 100 
a” E 000 
D ili 


tt 
‘ 


The message ABACCDA would then be encoded as 010100010000000111010. Such 
an encoding ‘is inefficient, since three bits are used for each symbol, so that 21 bits are 
needed to encode the entire message. Suppose that a two-bit code is assigned to each 
symbol, as follows: © |. ` ‘ 


—— 


Symbol Code 

elpiety ial ty 
A 60 
B 01 
Cou. 10 
D “ait 


Then the code for the message would be 00010010101 100, which requires only 14 bits. 
We wish to find a code that minimizes the length of the encoded message. 

Let us reexamine the above example. Each of the letters B and D appears 
only once in the message, whereas the letter A appears three times. If a code is 
chosen so that the letter A is assigned a shorter bit string than the letters B and D, 
the length of the encoded message would be small. This is because the short code 
(representing the letter A) would appear more frequently than the long code. Indeed; 
codes can be assigned as follows: a 


Symbol Code 
A 0 
B 110 
Og 10 
D HH 


Using this code, the message ABACCDA is encoded as 0110010101110. which requires 
only 13 bits. In very long messages containing symbols that appear very infrequently, 
the savings are substantial. Ordinarily. codes are not constructed on the basis of the fre- 
quency of characters within a single message alone, but on the basis of their frequency 
within a whole set of messages, The same code set is then used for each message. For 
example, if messages consist of English words, the known relative frequency of occur- 
rence of the letters of the alphabet in the English language might be used, although the 
relative frequency of the letters in any single message is not necegsarily the same. 

If variable-length codes are used. the code for one symbol may not be a prefix of 
the code for another. To see why, assume that the code for a symbol x, c(x), were a prefix 


S 
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of the code of another symbol y, cQ). Then when c(x) is encountered in a left-to-right 
scan, it is unclear whether c(x) represents the svnbol x or whether it is the first part 
of c(y). rs: 

In our example, decoding proceeds by scanning d bit string from left to right. If a 
0 is encountered as the first bit, the symbol is an A; otherwise it is a B, C, or D, and the 
next bit is examined. If the second bit is a 0, the symbol is a C; otherwise it must be a 
Bora D, and the third bit must be examined. I thirdbit isa 0, the symbol is a B; if 
it is a 1, the symbol is a D. As soon as the first symbol has been identified, the process 
is repeated starting at the next bit to find-the second symbol. 

This suggests a method for developing an optimal encoding scheme, given the 
frequency of occurrence of each symbol in a message. Find'the two symbols that ap- 
pear least frequently. In our example, these are B and D. The last bit of their codes 
differentiates one from the other: 0 for B and 1 for D: Combine these two symbols 
into the single symbdl BD, whose code represents the knowledge that a symbol is ei- 
ther a B or a D. The frequency of occurrence of this new symbol is the sum of the 
frequencies of its two constituent symbols. Thus the frequency of BD is;2. There are 
now three symbols: A (frequency 3), C (frequency 2) and BD (frequency 2). Again 
choose the two symbols with smallest frequency: C and BD. The last bit of their codes 
again differentiates one from the other: 0 for-C-and L fot BD. The two symbols are 
then combined into the single symbol-CBD with frequency 4. There are now only two 
symbols remaining: A and CBD. These are combined into the single symbol ACBD. 
The, last bits of the codes for A and CBD differentiate one from the other: O-for A and | 
for CBD. j 
The symbol ACBD contains the entire alphabet; it is assigned the null bit string of 
length Q.as its code. At the start of the decoding, before any bits have been examined, 
it is certain that any symbol is contained in ACBD. The two.symbols that make up 
ACBD (A and CBD) are assigned the codes 0 and 1, respectively. If a 0 is encountered, 
the: encoded symbol is an A; if a 1 is encountered, it is a C, a B, or a D. Similarly, 
the two symbols that ‘constitute CBD (C and BD) are assigned the codes 10 and 11, 
respectively. The first bit indicates that the symbol is one of the constituents of CBD, 
and the second bit indicates whether it is a C or a BD. The. symbols that make up 
BD (B and D) are then assigned the codes 110 and 111. By this process, symbols that 
appear frequently in the message are assigned shorter codes than symbols that appear 
infrequently. 

The action of combining two symbols into one Suggests the use of a binary tree. 
Each node of the tree represents a symbol and each leaf represents a symbol of the orig- 
inal alphabet. Figure 5.3.1a shows the binary tree constructed using the previous ex- 
ample. Each node in the illustration contains a symbol and its frequency. Figure 5.3.1b 
shows the binary tree constructed by this method for the alphabet and frequency table of 
Figure 5.3. 1c. Such trees are called Huffman trees after the discoverer of this encoding 
method. ; 

Once the Huffman tree is constructed, the code of any symbol in the alphabet 
can be constructed by Starting at the leaf representing that symbol and climbing up 
to the root. The code is initialized to null. Each time that a left branch is climbed, 0 
is appended to the beginning of the code; each time that a right branch is climbed, 1 is 
appended to the beginning of the code. 
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Figure 5.3.1 Huffman trees. 
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The Huffman Algorithm 


The inputs to the algorithm are n, the number of symbols in the original alphabet, 
and frequency, an array of size at least n such that frequency{i) is the relative frequency 
‘of the ith symbol. The algorithm assigns values to an array code of size at least n, so that 
code(i) contains the code assigned to the ith symbol. The algorithm also constructs an 
array position of size at least n such that position|i] points to the node representing the 
ith symbol. This array is necessary to identify the point in the tree from which to start 
in ccastructing the code for a particular symbol in the alphabet. Once the tree has been 
constructed, the isleft Operation vo * gi earlier can be used to determine whether 0 
» or 1 should be placed at the front of the code as we climb the tree, The info portion of 
a tree node contains the frequency of the occurrence of the symbol represented by that 
node, 

A set rootnodes is used to keep pointers to the roots of partial binary trées that 
are not yet left or right subtrees. Since this set is modified by removing elements with 
minimum frequency, combining them and then reinserting the combined element into 
the set, it is implemented as an ascending priority queue of pointers, ordered by the 
value of the info field of the pointers’ target nodes. We use the operations pqinsert, to 
insert a pointer into the Priority queue, and pgmindelete, to remove the pointer to the 
node with the smallest info value from the priority queue. 

We may outline Huffman’s algorithm as follows: 


/* initialize the set of root nodes */ 
rootnodes = the empty ascending priority queue; 
/* construct a node for each symbol */ 


for (i = 0; i < n i++) { 
p = maketree( frequency[i}); 
position{i] = py./* a pointer to the leaf containing */ 
«= ff the ith symbol */ 
pqinsert(rootnodes, p); 
} /* end for */ 
while (rootnodes contains more than one item) { 
pl = pqmindelete(rootnodes) ; 
p2 = pqmindelete(rootnodes) ; 
/* combine pl and p2 as branches of a single tree */ 
p = maketree(i#fo(pl) + info(p2)); 
setleft(p, pi); 
Setright(p, p2); 
pginsert(rootnodes, p); 
} /* end while */ 


/* the tree is now Constructed; use it to find codes */ 
root = pgmindelete(rootnodes) ; 
for (i =0; i <n; i+) { 

p = position[i]}; 

code[i] = the null bit string; 
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while (p != root) { 
/* travel up the tree */ 
if Cisleft(p)) 
code[i] = 0 followed by code[i]; 
else ; 
code[i] = 1 followed by code[i]; 
p = father(p); 
} /* end while */ 
} /* end for */ é 


<C Program k 

Note that the Huffman tree is strictly binary. Thus, if there are n symbols in the 
alphabet, the Huffman tree (which has n leaves) can be represented by an array of 
nodes of size 2n — 1. Since the amount of storage needed for the tree is known, it may 
be allocated in advance in an array node. 

In constructing the tree and obtaining the codes, it is only necessary to keep a 
» link from each node to its father’and an indication of whether each node is a left or a 

right son; left and right fields are unnecessary. Thus each node contains three fields: 
father, isleft, and freq. father is a pointer to the node’s father. If the node is the root, its 
father field is NULL. The value of isleft is TRUE if the node is a left son ànd AALSE 
otherwise. freq (which corresponds to the info fieid of the algorithm) i is the frequency 
of occurrence of the symbol represented by that node. 

We allocate the array node based on the maximum possible symbols (a constant 
maxsymbs) rather than on the actual number of symbols, n. Thus the array node, that 
should be of size 2n — 1, is declared as being of size 2 * MAXSYMBS — 1. This means 
that some space is wasted. Of course, n itself could be made a constant rather than a 
variable, but then the program must be rhodified every time that the number of symbols 
differs. The nodes can also le represented by dynamic variables without wasting space. 
However, we present a linked array implementation. (We could also input the value of 
n and allocate arrays of the proper size using malloc dynamically during execution. 
Then, no space would be wasted using an array implementation.) 

In using the linked array implementation, node[0] through noda + Jj can be 
reserved for the leaves representing the original n symbols of the alphabet, And node[n] 
through node[2 * n — 2} for the n — 1 nonleaf nodes required by the strctly binary 
tree. This means that the array position is not required as a guide to thé leaf nodes 
representing the n symbols, since the node containing the ith input symbol (where i 
goes from 0 to n — 1) is known to be node{i]. If the dynamic node repfesentation were 
used, the array position would be required. 

+» The following program encodes a message using Huffman’s algorithm. The input 
consists of a number n, which is the number of symbols in the alphabet, followed by a 
setof n pairs, each of which consists of a symbol and its relative frequency. The program 
first constructs a string alph, consisting ofall the-symbols in the alphabet, and an array 
code such that code[i] is the code assigned to the ith symbol in alph. The program then 
prints each character, its relative frequency and its code. 

Since the code is constructed from right to left, we define a structure codetype as 
follows: 
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#define MAXBITS 50 


Struct codetype { 
‘int bits [MAXBITS); 
int startpos; 


MAXBITS is the maximum number of bits allowed in a code. If a code cd is null, 
cd.startpos is equal to MAXBITS. When a bit b is added to cd at the left, cd.startpos is 
decremented by 1 and cd:bits[cd.startpos] is set to b. When the code cd is completed, 
the bits of the code are in positions cd.startpos through MAXBITS — 1 inclusive. . 

An important issue is how to organize the priority queue of root nodes. In the 
algorithm, this data structure was represented as a priority queue of node pointers. Im- 
plementing the priority queue by a linked list, as in Section 4.2, would require a new set 
of nodes, each holding a pointer to a root node and a next field. Fortunately the father 
field of a root node is unused, so that it can be used to link together all the root nodes 
into a list. The pointer rootnodes could point to t:> first root node on the list. The list 
itself can be ordered or unordered, depending on the implementation of pginsert and 
pqmindelete. ù 4 

We make use of this technique in the following program, which implements the - 
algorithm just presented. $ 


#define MAXBITS 50 
#define MAXSYMBS 50 
#define MAXNODES 99  /* MAXNODES equals 2*MAXSYMBS-1 */ 


struct codetype { 
int bits [MAXBITS]; 
int startpos; 


struct nodetype { 
int freq; 
int father; /* . If node[p] is not a root node, father points */ 
/* to the node's father; if- it is, father points +/ 
/* to the next root node in the priority queue t/ 
int isleft; 
k- 


void pqinsert(int, int); 
int pqmindelete(int); 


.. main() 
{ 
struct codetype cd, code[MAXSYMBS] ; 
struct nodetype node [MAXNODES]; 
int i, k, n, p, pl, p2, root, rootnodes; 
char symb, alph[MAXSYMBS}; 
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for (i = 0; i < MAXSYMBS; i++) 
alph[i] =" '; 

rootnodes = 0; - ; Š 

/* input the alphabet and frequencies . */ 

scanf("%d"; &n); : 

for (i = 0; i <n; i+) { E 
scanf("%s Xd", &symb, &node[i]. freq); 
Pqinsert(rootnodes, i); 
alph[{i] = symb; 

} /* end for. */ 


/* we now build the trees */ 
for (p = n; p < 2*n-1; n+) { j 
/* p-points to the next available node. Obtain the */ 
/* root nodes pl and p2 with smallest frequencies */ 
pl = pqmindelete(rootnodes) ; 
p2 = pqmindelete(rootnodes) ; 
/* set left(p) to pl and right(p) to p2 */ 
node[p1].father = p; 
node[pl].isleft = TRUE; 
node[p2].father = p; 
node[p2].isleft = FALSE; 
node[p].freq = node[p1].freq + node[p2]. freq; 
Pqinsert(rootnodes, p); f 
} /* end for */ 
/* There is now only one node left d 
/* witha null father field  +*/ 
root = pqmindelete(rootnodes) ; 
/* extract the codes from the tree 27 
for (i = 0; i <n; i+) { s 
/* initialize code[i] */ 
cd.startpos = MAXBITS; 
/* travel up the tree */ 
p=i; 
while (p != root) { 
--cd.startpos; 
if (node[p] -isleft) 
cd.bits[cd.startpos] = 0; 
else 
cd.bits(cd.startpos] = 1; 
p = node[p]. father; 
} /* end while */ 
for (k = cd.startpos; k < MAXBITS; k++) 
Se code[i].bits[k] = cd.bits[k]; 
.code[i].startpos = cd.startpos; 
} -/* end for */ 
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/* print results */ 

for (i = 0; i < n; i++) { 
printf("\n%c %d ", alph[i], nodes[i]. freq); 
for (k = code[i].startpos; k < MAXBITS; k++) 

printf("%d", code[i]. bitafki); 

printf("\n"); 

f /* end for */ 

} /* end main */ 


We leave to the reader the coding of the routine encode(alph, code, msge, bitcode). 
This procedure accepts the string alph, the array code constructed in the foregoing pro- 
gram, and a message msge and sets bitcode to the bit string encoding of the message. 
Given the encoding of a message and the Huffman tree used in constructing the 
code, the original message can be recovered as follows: Begin at the root of the tree. 
Each time that a 0 is encountered, move down a left branch, and each time that a | is 
encountered, move down a right branch. Repeat this process until a leaf is encountered. 
The next character of the original message : ‘he symbol that corresponds to that leaf. 
See if you can decode 1110100010111011 using the Huffman tree of Figure 5.3.1b. 
To decode it is necessary to travel from the root of the tree down to its Jeaves. 
This means that instead of father and isleft fields, two fields left and right are needed 
to hold the left and right sons of a particular node. It is straightforward to compute the 
fields.left and right from the fields father and isleft. Alternatively, the values left and 
. right can be constructed directly from the frequency information for the symbols of the. 
alphabet using an approach similar to.that used in assigning the value of father, (Of 
course, if the trees are to be identical, the symbol/frequency pairs must be presented 
in the same order under the two methods.) We leave these Aiporithme,; as well as the 
coud algorithm, as exercises for the reader. - 


EXERCISES 


5.3.1. Write a C function encode(alph, code, msge, bitcode). The function accepts the string 

alph and the array code produced by the program findcode in the text and a message 
é msge. The procedure sets bitcode to the Huffman encoding of that message. 

5.3.2. Write a C function decode(alph, left, right, bitcode, msge), in which alph is the string 
produced by the program findcode in the text, left and right are arrays used to represent a 
Huffman tree, and bitcode is abit string. The function sets msge to the Huffman decpding 
of bitcode. Ms 


15.3.3. „Įmplement the priority queue rootnodes as an ordered list. Write appropriate pees 
“and pqmindelete routines. 

4.3.4. Is it possible to have two different Huffman trees for a set of symbols with given fre- 
quencies? Either give an example in which two such trees exist or prove that there is 
only a single such tree. 

§,3.5. ‘Define the Fibonacci binary tree of order n as follows: If n = 0 or n = 1, the tree 
consists of a single node. If n > 1, the tree consists of a root, with the Fibonacci tree of 
order n — 1 as the left subtree and the Fibonacci tree of order n — 2 as the right subtree. 


(a) .Write a C function that returns a pointer to the Fibonacci binary tree of order r. 
(b) . Is such a tree strictly binary? 
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(c) . What is the number of leaves in the Fibonacci tree of order n? 
(d) What is the depth of the Fibonacci tree of order n? 

5.3.6. Given a binary tree 1, its extension is defined as the binary tree e(t) formed.from 1 by 
adding a new leaf node at each NULL left and right pointer in f. The new leaves are 
called external nodes, and the original nodes (which are now all nonleaves) are called 
internal nodes. e(t) is called an extended binary tree. 

(a) Prove that an extended binary tree is strictly binary. 

(b) If thas n nodes, how many nodes does e(t) have? 

(c) Prove that all leaves in an extended binary tree are newly added nodes. 

(d) Write aC routine that extends a binary tree t. 

(e) , Prove that any strictly binary tree with more than one node is an extension of one 
and only one binary tree. 

(f) Write aC function that accepts a pointer to a strictly binary tree fl containing more 

i than one node and deletes nodes from #1 creating a binary tree 12 such that 1] = 
e(12). 

(g) Show that the complete binary tree of order n is the nth extension of the binary 
tree consisting of a single node. 

5.3.7, Given a strictly binary tree ¢ in which the n leaves are labeled as nodes 1 through n, let 
level(i) be the level of node i and let freq(i) be an integer assigned to node i. Define the 
weighted path length of t as the sum of freq(i) * level(i) over all leaves of r. 

(a) Write aC routine to compute the weighted path length, given fields freg and father. 
(b) Show that the Huffman tree is the strictly binary tree with minimum weighted path 
length. 


5.4 REPRESENTING LISTS AS BINARY TREES 


Several operations can be performed on a list of elements. Included among these op: 
erations are adding a new element to the front or rear of the list, deleting the existing 
first or last element of the list, retrieving the kth element or the last element of the lis, 
inserting an element following or preceding a given element, deleting a given elemert, 
and deleting the predecessor or successor of a given element. Building a list with givn 
elements is an additional operation that is frequently required. 

Depending on the representation chosen for a list, some of these operations nay 
or may not be possible with varying degrees of efficiency. For example, a list nay. 
be represented by successive elements in an array or as nodes in a linked structre. 
Inserting an element following a given element is relatively efficient in a linkedlist 
(involving modifications to a few pointers aside from the actual insertion) but ela- 
tively inefficient in an array (involving moving all subsequent elements in the tray 
one position). However, finding the kth element of a list is far more efficient n an 

` atray (involving only the computation of an offset) than in a linked structure(that 
requires passing through the first k — 1 elements), Similarly, it is not possille to 
delete a-specific element in a singly linked linear list given only a pointer t that 
element, and it is only possible to do so inefficiently in a singly linked circulr list 
(by traversing the entire list to reach the previous element, and then performig the 
deletion). The same operation, however, is quite efficient in a doubly linked (liear or 
circular) list. 

In this section we introduce a tree representation of a linear list in which he op- 
erations of finding the Ath element of a list and deleting a specific element are rehtively 
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efficient. It is also possible to build a list with given elements using this representation. 
We also briefly consider the operation of inserting a single new element. 

A list may be represented by a binary tree as illustrated in Figure 5.4.1. Figure 
5.4.1a shows a list in the usual linked format, while Figure 5.4, 1b and c show two binary 


I W 


(b) 


(c) 


Figure 5.4.1 List and two corresponding binary trees. 
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tree representations of the list. Elements of the original list are represented by leaves of 
the tree (shown as squares in the figure), whereas nonleaf nodes of the tree (shown as 
circles in the figure).are present as part of the internal tree structure. Associated with 
each leaf node are the contents of the corresponding list element. Associated with each 
nonleaf node is a count representing the number of leaves in the node’s left subtree. 

(Although this count can be computed from the tree structure, it is maintained as a data 
element to avoid recomputing its value each time that it is needed.) The elements of 
the list in their original sequence are assigned to the leaves of the tree in the inorder 
sequence of the leaves. Note from Figure 5.4.1 that several binary trees can represent 
the same list. 


Finding the kth Element 


` To justify using so many extra tree nodes to represent a list, we present an algo- 
` rithm to find the kth element of a list represented by a tree. Let tree point to the root of 
the tree, and let /count(p) represent the count associated with the nonleaf node pointed 
to by p [/count(p) is the number of leaves in the tree rooted at node(left(p))). The fol- 
lowing algorithm sets the variable find to point to the leaf containing the kth element of 
the list. Š 
- The algorithm maintains a variable r containing the number of list elements re- 
maining to be counted. At the beginning of the algorithm r is initialized to k. At each 
nonleaf node(p), the algorithm determines from the values of r and Icount(p) whether 
the kth element is located in the left or right subtree. If the leaf is in the left-sub- 
tree, the algorithm proceeds directly to that subtree. If the desired leaf is in the right 
subtree, the algorithm proceeds to that subtree after reducing the value of r by the 


value of Icount(p). k is assumed to be less than or r equal to the number of elements in 
the list. 


r= k; 
`p = tree; 
while (p is not a leaf node) 
if (r <= 1count(p)) 
p= left(p); 7 
else { 
-f-= Icount(p); 
p= right(p); + 
} /* end if */ : „i 
find = p; 


Figure -5.4.2a illustrates finding the fifth element of a list in the tree of Figure 
5.4.1b, and Figure 5.4.2b illustrates finding the eighth element in the tree of Figure 
5.4.1c. The dashed line represents the path taken by the algorithm down the tree to the 
appropriate leaf. We indicate the valuz of r (the remaining number of elements to be 
counted) next to each node encountered by the algorithm. 

The number of tree nodes examined in finding the kth list element is less than or 
equal to | more than the depth of the tree (the longest path in the tree from the root to 
. a leaf ). Thus four nodes are examined in Figure 5.4.2a in finding the fifth element of the 
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Figure 5:4.2 Finding the nth element of a tree-represented list. 


list, and also in Figure 5.4.2b in finding the eighth element. If a list is represented as 
a linked structure, four nodes are accessed in finding the fifth element of the list [that 
is, the operation p = next(p) is performed four times] and seven nodes are accessed in 
finding the eighth element. 

Although this is not a very impressive saving, consider a list with 1000 elements. 
A binary tree of depth 10 is sufficient to represent such a list. since logs 1000 i is less 
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than 10. Thus, finding the kth element (regardless of whether k was 3, 253, 708, or 
999) using such a binary tree would require examining no more than 11 nodes. Since 
the number of leaves of a binary tree increases as 24, where d is the depth of the tree, 
such a tree represents a relatively efficient data structure for finding the kth element of 
a list. If an almost complete tree is used, the kth element of an n-element list can be 
found in at most logan + 1 node accesses, whereas k accesses would be required if a 
linear linked list were used. 


Deleting an Element 


How can an element be deleted from a list represented by a tree? The deletion. 
itself is relatively easy. It involves only resetting a left or right pointer in the father of 
the deleted leaf di to null. However, to enable subsequent accesses, the counts in all 
ancestors of d! may have to be modified. The modification consists of reducing count 
by 1 in each node nd of which di was a left descendant, since the number of leaves in 
the left subtree of nd is | fewer. At the same time, if the brother of di is a leaf, it can 
be moved up the tree to take the place of its father. We can then move that node up 
even further if it has no brother in its new position. This may reduce the depth of the 
resulting tree, making subsequent accesses slightly more efficient. : f 

We may therefore present an algorithm to delete a leaf pointed to by p from a tree 
(and thus a element from a list) as follows. (The line numbers at the left are for future 
reference.) i 


if (p == tree) { 
tree = null; 
free node(p); 


else { 
f = father(p); 
/* remove node(p) and set b'to point to its brother */ 
if (p == left(f)) { ` 
9 left(f) = nul); 
10 b = right(f); 
11 --Icount(f); 
} 


133 else{ 

14 right(f) = null; 

15 b= left(f); 

16 . } /* end if */ 

17 if (node(b) is a leaf) { 


oe 


18 /* move the contents of node(b) up to its */ 
19° vA father and free node(b) */ 
20 info(f) = info(b); 

21 Teft(f) = null; 

22 right(f) = nul); 

23 Tcount(f) = 0; . ` 

24 free node(b); 


25 } /* end if */ 
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26 free node(p); . 
27  /* climb up the tree */ 


28 qf; 
29 while (q != tree) { 
30 f = father(q); 
31 if (q == left(f)) { 
32 /* the deleted leaf was a left descendant */ 
IS of node(f) */ 
33 --Icount(f); 
34 b = right(f); 
35 } ; 
- 36 else 
37 b = left(f); Y: 
38 /* node(b) is the brother of node(q) */ 
39 if (b == null && node(q) is 4 Jeaf){ 
40 /* move up the contents of node(q) */ 
41 /* to its father and free node(q) */ 
42 info(f) = info(q); 
43  Jeft(f) = null; 
44 right(f) = null; 
45 lċount(f) = 0; 
46 free node(q); 
47 } /* end if */ 
48 q= f; 


49  } /* end while */ 
50 } /* end else */ 


Figure 5.4.3 illustrates the results of this algorithm for a tree in which the nodes 
C. D, and-B are deleted in that order. Make sure that you follow the actions of the 
algorithm on these examples. Note that the algorithm maintains a 0 count in leaf nodes 
for consistency, although the count is not-required for such nodes. Note also that the 
algorithm never moves up a nonleaf node even if this could be done. (For example, 
the father of A and B in-Figure 5.4.3b has not been moved up.):We can easily modify 
- the algorithm to do this (the modification is left to the reader) but have not done so for 

reasons that will become apparent shortly. 

This deletion algorithm involves inspection of up to two nodes (the ancestor of 
the node being deleted and that ancestor's brother) at each level. Thus, the operation 
of deleting the kth element of a list represented by a tree (which involves finding the 
element and then deleting it) requires a number of node accesses approximately equal to 
three times the tree depth. Although deletion from a linked list requires accesses to only 
three nodes (the node preceding and following the deleted node as well as the deleted 
node), deleting the kth element requires a total of k + 2 accesses (k — 1 of which are-to 
logate the node preceding the kth). For large lists, therefore. the tree representation is 
more efficient. EA 

Similarly we can compare favorably the efficiency of tree-represented lists with 
array-represented lists. If an n-element list is maintained in the first» elements of an ar- 
ray. finding the Ath element involves only a single array access, but deleting it requires 
shifting the n — k elements that had followed the deleted element. If gaps are allowed in 
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Figure 5.4.3 Deletion algorithm. 
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the array so that deletion can be implemented efficiently (by setting d flag in the array 
position of the deleted element without shitting any‘subsequent elements), finding the 
` kth element requires at least k array. accesses- The reason is that it is no longer possible 
to know the array position of the kth element in the list, since gaps may exist among the 
elements in the array. [We should note, howe ver, that if the order of the elements in the 
list is irrelevant, the kth element in an array can be deleted efficiently by overwriting 
it with the element in position n (the last element) and adjusting the count ton — H. 
However, it is unlikely that we would want to delete the kth clement froma list in Which 
the order is “irrelevant, since there would then be no significance in the kth clement over 
any of the others. | 

Inserting a new kth element into a tree-represented list [between the (X — i)st 
and the previous Ath] is also a relatively efficient operation. The insertion consists of 
locating the Ath element, replacing it with a new nonleaf that has a leaf containing 
the new element as its left:son and a leaf containing the old Ath element as its right 
son, and adjusting appropiriate counts among its ancestors. We leave the details to the 
reader. (However, repeatedly adding a mew kth clement by this method causes the tree 
to become highly unbalanced, since the branch containing the kth element becomes dis- 
proportionately long comipared with the other branches. This means that the efficiency 
of finding the Ath element is not as grezit as it would be in a balanced tree in which all 
paths are approximately the same length. The reader is encouraged to find a “balanc- 
ing” strategy to alleviate this probsem. Despite this problem, if insertions into the tree 
are made randomly. so that it is equally likely for an element to be inserted at any given 


position, the resulting tee remains fairly balanced and finding the kth element remains 
efficient.) 


implementing Tree-Represented Lists in C - 


The C implementations of the search and deletion algorithms are straightforward 
using the linked representation of binary trees. However, such a representation requires 
info, count, father, left, and right fields for each tree node, whereas a list node requires 
only info and next fields. Coupled with the fact that the tree representation requires ap- 
proximately twice as many nodes as a linked list, this space requirement may make the 
tree representation impractical. We could. of course. utilize external nodes containing 
only an info field (and perhaps a father field) for the leaves, and internal nodes con- 
taining fcount, father, left, and right fields for the nonleaves. We do noi pursue that 
possibility here. 

Under the sequential representation of a binary tree. the space requirements are 
not nearly so great. If we assume that no insertions are required once the tree is con- 
structed and that the initial list size is known, we can set aside an array t6 hold an almost 
complete strictly binary tree representation of the list. Under that representation farien 
left-and right fields are unnecessary. As we shall'soon show. it is always possible to 
construct an almost complete binary tree representation of a list. oid 

Once the tree has been constructed. the only fields required are info. counr, and a 
field used to indicate whether or not an array element represents an existing or a deleted 
tree node. Also. as we have noted before. /count is only required for nonleaf nodes of the 
tree. so that a structure could actually be used with either the feo! field or the info tield. 
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depending on whether or not the node is a leaf. We leave this possibility as an exercise 
for the reader. It is also possible to eliminate the need for the used field at some expense 
to time efficiency (see Exercises 5.4.4 and 5.4.5). We assume the following definiuons 
and declarations (assume 100 elements in the list): 


#define MAXELTS 100 /* maximum humber of list elements */ 
#define NUMNODES 2*MAXELTS - 1 
#define BLANKS " `- į /* 20 blanks */ 
struct nodetype { 

char info[20]; 

int lcount; 

int used; 
} node [NUMNODES] ; 


A nonleaf node can be recognized by an info value equal to BLANKS. father(p), left(p), 
and right(p) can be implemented in the usual way as ( p= 1)/2,2* p+1,and2* p+2, 
respéctively. ; 

i A C routine to find the kth element follows. It uses the library routine strcmp, 
which returns 0 if two strings are equal. < 


int findelement(int k) 


int p, r; 
FaK á 
p = 0; 
while (strcmp(node[p].info, BLANKS) == 0) 
if (r <= node[p].]count) 
EE p= pt2 + 1; 
“r>a _node[p]. lcount; 
p= p* ; 
} /* end if */ 
return(p); 
} /* end findelement */ 


The C routine to delete the leaf pointed to by p using the sequential representation 
„is somewhat simpler than the corresponding algorithm presented in the foregoing. We 
can ignore all assignments of nuli (lines 2, 9, 14, 21, 22, 43 and 44), since pointers, 
are not used. We can also ignore the assignments of 0 to an Icount field (lines 23 and 
45). since such:an assignment is part of the conversion of a nonleaf to a leaf, and in 
our C representation the /count field in leaf nodes is unused. A node can be recognized 
as a leaf (lines 17*and 39) by a nonblank info value, and the pointer b as null (line 
39) by a FALSE value for node|b}.used. Freeing a node (lines 3, 26, and 46) is accom- 
plished by setting its used field to FALSE. The routine uses the library routine strepy(s,1),. 
which assigns string z to string s. and the routine strcmp to compare two strings for 
equality. 
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void delete(int p) 
t 
int b, f, q; 


if (p == 0) ; ' 
node[p].used = FALSE; ` /* Algorithm lines 1-4. */ 
ae { 
= (p-1) / 2; /* Algorithm Tine 6 */ 
# (p%2 != 0) { /* Algorithm line 8 “ 
b = 2*f + 2; 
--node[f] . Icount; 
} 
else 
b = 2*f +1; 
if (strcmp(node[b].info, BLANKS) != 0) { 
/* Algorithm lines 17-25 */ 
strcpy(node[f].info, node[b]. info); 
node[b].used = FALSE; 
} /* end if */ 


node[p].used = FALSE; /* Algorithm line 26 */ 
q = f; /* Algorithm line 28 */ 
while (q != 0) { 
f = (q-1) / 2; /* Algorithm Tine 30 ey 
if (q% 2 !=0) { '/* Algorithm Tine 31 ay 
--node[f]. count; 
b = 2*f + 2; 
} 
else 
b = 2*f +1; 


if (‘node[b}.used && stromp(node[{q].info, BLANKS) != 0) { 
/* Algorithm lines 39-47 */ 
strcpy(node[f].info, node[q].info); 
node[q].used = FALSE; 
} /* end if */ - 
q=f; 
23 } /* end while */ 
} /* end if */ 
} /* end delete */ 


Our use of the sequential representation explains the reason for not moving a nonleaf 
without a brother further up in a tree during deletion. Under the sequential represen- 
tation, such a moving-up process would involve copying the contents of all nodes in 
the subtree within the array, whereas it involves modifying oP a single pointer if the 
linked representation is used. 


Constructing a Tree-Represented List 
$ { 

We now return to the claim that, given a list of n elements, it is:possible to con- 

struct an almost complete strictly binary tree representing the list. We have already seen 
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in Section 5.) thatitis possible to construct an almost complete strictly binary tree with 
n leaves und 2 *n — 1 nodes, The leaves of such a tree occupy nodes numbered n — | 
through 2 * n — 2. If dis the smallest integer such that 2% is greater or equal to n (that 
is, if d equals the smallest integer greater than or equal to logan), d equals the depth of 
the tree. The number assigned to the first node on the bottom level of the tree is 2% — 1}, 
The first elements of the list are assigned to nodes numbered 24 — } through 2 * n — 2, 
and the remainder (if any) to nodes numbered n — | through 2° — 2. In constructing a 
tree representing a list with n elements, we can assign elements to the info fields of tree 
leaves in this sequence and assign a blank string to the info fields of the nonleaf nodes, 
numbered 0 through'n — 2. It is also a simple matter to initialize the used field to true 
in al] nodes numbered 0 to 2* n — 2. 

Initializing the values of the Icount array is more difficult. Two methods can be 
used: one involving more time and a second involving more space. In the first method, 
all ¿count fields are initialized to 0, Then the tree is climbed from each leaf to the tree 
root in turn. Each time a node is reached from its left son, 1 is added to its Ieount 
field. After this process is performed for each leaf. all count values have been properly 


assigned. The following routine uses this method to construct a tree.from a list of input 
data: * A i : f 


void buildtree(int n) 
{ ‘ i 


int d, f, i, p, power, size; ` r ; 


/* compute the tree depth d and the value of 24 */ 
d = 0; . 
power = 1; 
while (power < n) { 
+d; 
power *= 2; G 
} /* end while */ ; 
/* assign the elements of the list, initialize the used flags, */ 
/* and initialize the Ycount field to 0 in all nonleaves =f 
sizes 2*n - 1; ‘ 
for (i = power-1; i < sizel; i++) { 
scanf ("%d", &node[i]. info); : 
node[i].used = TRUE; 
+ /* end for */ tt 
for (i=n-1; i < power-1; 144) { {hry 
és scanf("%s", nodefi]. info); 
_ Wode[i].used = TRUE; 
} = /* end for */ 
for (i=0; i < n-1; i++) { 
node[i].used, = TRUE; 
node[i].lcount = 0; 
Strcpy(node[i}.info, BLANKS); 
} /ž end for */ p 


' 
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/* set the \count fields */ 
for (i=n-1; i < size; i++) { 
/* follow the path from each leaf to the root */ 
pei; 
while (p != 0) { 
f = (p-1)-/ 2; 
if (p% 2 != 0) 
‘++node[f].1count; 
p= f; 
} /* end while */ 
} /* end for */ 
} /* end buildtree */ 


“The second method uses an additional field, rcount, ineach node to hold the num- 

__ ber of leaves in the right subtree of each nonleaf node. This field as well as the [count 
field is set to 1 in each nonleaf that is the father of two leaves. If n is odd, so that there 
is a node (numbered (n — 3), 2) that is the father of a leaf and a nonleaf, Jcount in that 
node is set to 2 and rcount to 1. 

The algorithm then goes through the remaining array elements in reverse order, 
setting [count in each node to the sum of /count and rcount in the node’s left son, and 
rcount to the sum of /count and rcount in the node’s right son. We leave to the reader 
the C implementation of this technique. Note that rcount can be implemented as a local 
array in buildtree rather than as a field in every node, since its values are unused once 
the tree is built. 

This second method has the advantage that i it visits each nonleaf once to directly 
calculate its Icount (and rcount) value. The first method visits each nonleaf once for 
each of its leaf descendants, adding one to /count each time that the leaf is found to be a 
left descendant. To counterbalance this advantage, the second method requires an extra 

` recount field, whereas the first method needs no extra fields. 


The PEN Problem Revisited 


The Josephus problem of Section 4.5 is a perfect example of the utility of the 
binary tree representation of a list. In that problem it was necessary to repeatedly find 
the mth next element of a list and then delete that element. These are operations that 
can be performed efficiently in a tree-represented list. 

If size equals the number of elements currently in a list, the position of the mth 
node following the node in position k that has just been deleted is given by 1 +(A-—2+m) 
% size. (Here we assume that the first node in the list is considered in position l, not 
in position 0.) For example, if a list has five elements and the third element is deleted, 
and we wish to find the fourth element following the deleted element, size = 4. k = 3, 
and m = 4. Then k — 2 + m equals 5 and (k — 2 +m) % size is 1, so that the fourth 
_ element following the deleted element is in position 2. (After deleting element 3. we 
count elements 4, 5, 1, and 2.). We can therefore write a C function follower to find the 
mth node following a node in positión k that has just been deleted and to reset & to its 
position. The routine calls the routine findelement presented earlier. 
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int follower(int size, int m, int *pk) 
{ 


int j, d; 

j=k-2+m; 

*pk = (f % size) + 1; 

return(findelement(*pk)); 
} /* end follower */ 


The following C program implements the Josephus algorithm using a tree- 
represented list. The program inputs the number of people in a circle (7), an integer 
count (m), and the names of the people in the circle in order, beginning with the person, 
from whom the count starts. The people in the circle are counted in order and the person 
at whom the input count is reached leaves the circle. The count then begins again from 
1, starting at the next person. The program prints the order in which people leave the 
circle. Section 4.5 presented a program to do this using a circular list in which (n — 1) 
* m nodes are accessed once the initial list is constructed. The following algorithm 
accesses fewer than (n — 1) * logon nodes once the tree is built. 


/* definitions of MAXELTS, NUMNODES , BLANKS, */ = 
yf and nodetype go here x 


void buildtree(int); 
int follower(int, int, int zje 
void delete(int); 


main() 


int k, m, n, p, size; 
struct nodetype node[NUMNODES] ; 


scanf ("%d%d", &n, &m); ; 
buildtree(n); f 
k=n +1; /* initially we have "deleted" the (n+1)st person */ 
for (size = n; size > 2; --sizże) { 
/* repeat until one person is left */ 
p = follower(size, m, &k); 
printf("Xd\n", node[p]. info); 
delete(p); 
} /* end for */ 
** printf("%d", node[0). info); 
} /* end main */ 


EXERCISES 


5.4.1. Prove that the leftmost node at level n in an almost complete strictly binary tree is as- 
signed the number 2”. 
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5.4.2. Prove that the extension (see Exercise 5.3.5) of an almost complete binary tree is almost 
complete. 


5.4.3. For what values of n and m is the solution to the Josephus problem given in this section 


faster in execution than the solution given in Section 4.5? Why is this so? 

5.4.4. Explain how we can eliminate the need for a used field if we elect not to move up a 
newly created leaf with no brother during deletion. 

5.4.5. Explain how we can eliminate the need for a used field if we set count to ~1 inanonleaf 
that is converted to a-leaf node and reset info to blanks in a deleted node. 

5.4.6. Write a C routine buildtree in which each node îs visited only once by using an rcount 
array as described in the text. 

5.4.7. Show how to represent a linked list as an almost complete binary tree in which each list 


element is represented by one tree node. Write a C function to return a pointer to the kth 
element of such a list. 


5.5 TREES AND THEIR APPLICATIONS 


In this section we consider general trees and their representations. We also investigate 
some of their uses in problem solving. 

A tree is a finite nonempty set of elements in which one element is called the root 
and the remaining elements are partitioned into m > = 0 disjoint subsets, each of which 
is itself a tree. Each element in a tree is called a node of the tree. 

Figure 5.5.1 illustrates some trees. Each node may be the root of a tree with zero or 
more subtrees. A node with no subtrees is a leaf. We use the terms father, son, brother, 
ancestor, descendant, level, and depth in the same sense that we used them for binary 
trees. We also define the degree of a node in a tree as the number of its sons. Thus in 
Figure 5.5.la, node C has degree 0 (and is therefore a leaf), node D has degree 1, node 
B has degree 2, and node A has degree 3. There is no upper limit on the degree of. a 
node. 

Let us compare the trees of Figure 5.5. la and c. They are equivalent as trees. Each 
has A as its root and three subtrees. One of those subtrees has root C with no subtrees, 
another has root D with a single subtree rooted at G, and the third has root B with two 
subtrees rooted at E and F. The only difference between the two illustrations is the order 
in which the subtrees are arranged. The definition of a tree makes no distinction among 
subtrees of a general tree, unlike a binary tree, in which a distinction is made between 
the left and right subtrees. 

An ordered tree is defined as a tree in which the subtrees of each node form 
an ordered set. In an ordered tree we may speak of the first, second. or last son of a 
. particular node. The first son of a node in an ordered tree is often called the oldest 
son of that node, and the last son is called the youngest. Although the treés of Figures 
5.5.1a and c are equivalent as unordered trees, they are different as ordered trees. In the 
remainder of this chapter we use the word “tree™ to refer to “ordered tree.” A forest is 
an ordered set of ordered trees. , 

The question arises whether a binary tree is a tree. Every binary tree except for 
the empty binary tree is indeed a tree. However, not every tree is binury. A tree node 
may have more than two sons. whereas a binary tree node may not. Even a tree whose — 
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(c) 
Figure 5.5.1 Examples of trees. 
nodes have at most two sons is not necessarily a binary tree. This is because an only 
son in a general tree is not designated as being a “left” or a “right” son, whereas in a 
binary tree, every son must be either a “left” son or a “right” son. In fact. although a 


nonempty binary tree is a tree, the designations of left and right have no meaning within 
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the context of a tree (except perhaps to order the two subtrees of those nodes with two 
sons). A nonempty binary tree is a tree each of whose nodes has a maximum of two 
subtrees which have the added designation of “left” or “right.” 


C Representations of Trees 


How can an ordered tree be represented in C? Two alternatives immediately come 
to mind: an array of tree nodes may be declared or a dynamic variable may be allocated 
for each node created. However, what should the structure of each individual node be? 
In the representation of a binary tree, each node contains an information field and two 
pointers to its two sons. But how many pointers should a tree node contain? The number 


of sons of a node is variable and may be as large or as small as desired. If we arbitrarily 
declare : 


#define MAXSONS 20 


struct treenode { 
int info; 
Struct treenode *father; 
Struct treenode *sons[MAXSONS]; 


we are restricting the number of sons a node may have to a maximum of 20. Although 
in most cases this will be sufficient, it is sometimes necessary to create dynamically a 
node with 21 or 100 sons. Far worse than this remote possibility is the fact that twenty 
units of storage are reserved for each node in the tree even though a node may actually 
` have only 1 or 2 (or even 0) sons. This is a tremendous waste of space. 
One alternative is to link all the sons of a node together in a linear list. Thus the 
set of available nodes (using the array implementation) might be declared as follows: 


#define MAXNODES 500 


struct treenode { 
int info; 
int father; 
int son; 
int next; 
}; 
struct treenode node[MAXNODES]; 


node[p].son points to the oldest son of node[p}, and node[p].next points to the next 
younger brother of node[p]. 


Alternatively, a node may be declared as a dynamic variable: 


struct treenode { 
int info; . 
struct treenode *father; 
struct treenode *son; 
struct treenode *next; 
J; 
typedef struct treenode *NODEPTR; 
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Figure 5.5.2 Tree representations. (See reference on page 309.) 


If all traversals are from a node to its sons, the father field may be omitted. Figuie 5.5.2 
illustrates the representations of the trees of Figure 5.5.1 under these methods if no 
father field is needed. 

Even if it is necessary to access the father of a node, the father field can be omitted 
by placing a pointer to the father in the next field of the youngest son instead of leaving 
it null. An additional logical field could then be used to indicate whether the next field 
points to a “real” next son or to the father. Alternatively (in the array of nodes implemen- 
tation), the contents of the next field can contain negative as well as positive indices. A 
negative value would indicate that the next field points to the node’s father rather than 
to its brother, and the absolute value of the next field yields the actual pointer. This is 
similar to the representation of threads in binary trees. Of course, in either of these two 
latter methods, accessing the father of an arbitrary node would require a traversal of the 
list of its younger sons. 

If we think of son as corresponding to the left pointer of a binary tree node and 
next as corresponding to its right pointer, this method actually represents a general 
ordered tree by a binary tree. We may picture this binary tree as the original tree tilted 
45 degrees with all father-son links removed except for those between a node and its 
oldest son, and with links added between each node and its next younger brother. Figure 
5.5.3 illustrates the binary trees corresponding to the trees of Figure 5.5.1. 

In fact, a binary tree may be used to represent an entire forest, since the next 
pointer in the root of a tree can be used to point to the next tree of the forest. Figure 
5.5.4 illustrates a forest and its corresponding binary tree. 


Tree Traversals 

The traversal methods for binary trees induce traversal methods for forests. The 
preorder, inorder, or postorder traversals of a forest may be defined as the preorder, 
inorder, or postorder traversals of its corresponding binary tree. If a forest is represented 


as a set of dynamic variable nodes with son and next pointers as previously, a C routine 
to print the contents of its nodes in inorder may be:written as follows: 


void intrav(NODEPTR p) 
{ 


if (p != NULL) { 
intrav(p->son); 
printf("Xd\n", p->info); 
intrav(p->next) ; 
} /* end if */ 
} /* end intrav */ 


Routines for preorder and postorder traversals are similar. 
These traversals of a forest may also be defined directly as follows: 


PREORDER 
1. Visit the root of the first tree in the forest. 
2. Traverse in preorder the forest formed by the subtrees of the first tree. if any. 
3. Traverse in preorder the forest formed by the remaining trees in the forest, if any. 
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i Figure 5.5.3 Binary trees corresponding to trees of Figure 5.5.1. 
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Figure 5.5.4 Forest and its corresponding binary tree, 
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INORDER 


1. Traverse in inorder the forest famed by the subtrees of the first tree in the forest, 
if any. 


2. Visit the root of the first tree. 
3. Traverse in inorder the forest formed by the remaining trees in the forest, if any. 
POSTORDER 


1. Traverse in postorder the forest formed by the subtrees of the first tree in the forest, 
if any. 

2. Traverse in postorder the forest formed by the remaining trees in the forest, if any. 

3. Visit the root of the first tree in the forest. 


The nodes of the forest in Figure 5.5.4 a may be listed in preorder as ABCDE- . 
FGHIJKLMPRQNO, in inorder as BDEFCAIJKHGRPQMNOL and in postorder as 
FEDCBKJIHRQPONMLGA. Let us call a traversal of a binary tree a binary traver- 
sal, and a traversal of an ordered general tree a general traversal. 


General Expressions as Trees 


An ordered tree may be used to represent a general expression in much the same 

‘way that a binary tree may be used to represent a binary expression. Since a node may 

have any number of sons, nonleaf nodes need not represent only binary operators but 
` can represent operators with any number of operands. Figure 5.5.5 illustrates two ex- 
pressions and their tree representations. The symbol “%” is used to represent unary 
negation to avoid confusing it with binary subtraction that is represented by a minus 
sign. A function reference such as f(g,h,i,j) is viewed as the operator f applied to the 
operands g,h,i, and j. l 

A general traversal of the trees of Figure 5.5.5 in preorder results in the strings 
*% + AB -— +C log +D!EFGHIJ and q + AB sinC *X + YZ, respectively. 
These are the prefix versions of those two expressions. Thus we see that preorder gen- 
eral traversal of an expression tree produces its prefix expression. Inorder general 
traversal yields the respective strings AB + % CDE! + log + GHIJF — * and 
AB + C sin XYZ + *q, which are the postfix.versions of the two expressions.’ 

The fact that an inorder general traversal yields a postfix expression might be 
surprising at first glance. However, the reason for it becomes clear upon examination 
of the transformation that takes place when a general ordered tree is represented by a 
binary tree. Consider an ordered tree in which each node has zero or two sons. Such a 
tree is shown in Figure 5.5.6a, and its binary tree equivalent is shown in Figure 5.5.6b. 
Traversing the binary tree of Figure 5.5.6b is the same as traversing the ordered tree of 
Figure 5.5.6a. However, a tree such as the one in Figure 5.5.6a may be considered as a 
binary tree in its own right. rather than as an ordered tree. Thus it is possible to perform 
a binary traversal (rather than a generai traversal) directly on the tree of Figure 5.5.6a. 
Beneath that figure-are the binary traversals of that tree, and opposite Figure 5.5.6b are 
the binary traversals of the tree in that figure, which are the same as the traversals of 
the tree of Figure 5.5.6a if it is considered as an ordered tree. , 

Note that the preorder traversats of the two binary trees are the same. Thus if a 
preorder traversal on a binary tree representing a binary expression yields.the prefix of 


Trees Chap. 5 
312 


(a) -(A + B)» (C + log(D + E!)-f(G, H,1,J)) 


` (b) Q(A +B, sim(C), X e (Y + ZY 


Figure 5.5.5 Tree representation of an arithmetic expression: 
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Preorder: + *AB+*CDE 
inorder: A*B+C*#D+E 
Postorder: AB * CD * E + + 


Preorder: pë AB + * CDE 
Inorder: AB*CD*E ++ 
Postorder: BADCE * + * + 


(b) 


Figure 5.5.6 


the expression, that traversal on an ordered tree representing a general expression that 
happens to have only binary operators yields prefix as well. However, the postorder 
traversals of the two binary trees are not the same. Instead, the inorder binary traversal 
of the second (which is the same as the inorder general traversal of the first if it is 
considered as an ordered tree) is the same as the postorder binary traversal of the first. 
Thus the inorder general traversal of an ordered tree representing a binary expression 
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is equivalent to the postorder binary traversal of the binary tree representing that ex- 
pression, which yields postfix. i 


Evaluating an Expression Tree 


Suppose that it is desired to evaluate an expression whose operands are all nu- 
merical constants. Such an expression can be represented in C by a tree each of whose 
nodes is declared by- 


#define OPERATOR 0 
_ #define OPERAND 1 3 - 
„Struct treenode {` 
short int utype; /* OPERATOR or OPERANDO */ 
union { 
char operator[10]; 
float val; 
} info; s 
struct treenode *son; 
Struct treenode *next; . 


}; 
‘typedef treenode *NODEPTR; 


Ti:e son and next pointers are used to link together the nodes of a tree as previously 
illustrated. Since a node may contain information that may be either a number (operand) 
or a character string (operator), the information portion of the node is a union component 
of the structure. 

We wish to write a C function evaltree(p) that accepts a pointer to such a tree 
and returns the value of the expression represented by that tree. The routine evalbintree 
presented in Section 5.2 performs a similar function for binary expressions. evalbintree 
utilizes a function oper, which accepts an operator symbol and two numerical operands 
and returns the numerical result of applying the operator to the operands. However, in 
the case of a general expression we cannot use such a function, since the number of 
operands (and hence the number of arguments) varies with the operator. We therefore 
introduce a new function, apply(p), which accepts a pointer to an expression tree that 
contains a single operator and its numerical operands and returns the result of applying 
the operator to its operands. For example, the result of calling the function apply with 
parameter p pointing to the tree in Figure 5.5.7 is 24. If the root of the tree that is 
passed to evaltree represents an operator, each of its subtrees is replaced by tree nodes 
representing the numerical results of their evaluation so that the function apply may be 
called. As the expression is evaluated, the tree nodes representing operands are freed 
and operator nodes are converted to operand nodes. z l 

We present a recursive procedure replace that accepts a pointer to an expression 
tree and replaces the tree with a tree node containing the numerical result of the expres- 
sion’s evaluation. i 

void replace(NODEPTR p) 

{ 


float value; 
NODEPTR q, r; 
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if (p->utype == OPERATOR) { 

/* the tree has an operator */ 
lig as its root */ 
q = p->son; 

while (q != NULL) { 

' /* replace each of its subtrees 
is by operands 
replace(q); 

q = q->next; 
} /* end while */ $ 
/* apply the operator in the root tọ 
/* the operands in the subtrees 
value = apply(p); 
/* replace the operator by the result 
P->utype = OPERAND; 
p->val = value; 
lid fre@"911 the. subtrees 
q = p->son; 
~~p->son = NULL; 
while (q != NULL) { 
r=qi 
q = Q->next; 
free(r); 
} /* end while */ 
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Figure 5.5.7 Expression tree. 
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} /* end if */ 
} /* end replace */ 


The function evaltree may now be written as follows: 


float evaltree(NODEPTR p) 


{ 
NODEPTR q; 


replace(p); 
return(p->val); 
free(p); 
} /* end evaltree */ 
After calling evaltree(p) the tree is destroyed and the value of p is meaningless. 
This is a case of a dangling pointer in.which a pointer variable contains the address of 
a variable that has been freed. C programmers who use dynamic variables should be 
careful to recognize such pointers and not to use them subsequently. 


Constructing a Tree 


A number of operations are frequently used in constructing a tree. We now present 
some of these operations and their C implementations. In the C representation, we as- 
sume that father pointers are not needed, so that the father field is not used and the next 
pointer in the youngest node is null. The routines would be slightly more complex and 
less efficient if this were not the case. 

The first operation that we examine is setsons. This operation accepts a pointer 
to a tree node with no sons and a linear list of nodes linked together through the next 
field. setsons establishes the nodes in the list as the sons of the node in the tree. The 
C routine to implement this operation is straightforward (we use the dynamic storage 
implementation): ‘ 


void setsons(NODEPTR p, NODEPTR list) 
{ ' 


/* p points to a tree node, list to a list */ 
/* of nodes linked together through their */ 
/* next fields */ 
if (p == NULL) { 
printf("invalid insertion\n"); 
exit(1); 
} /* end if */ 
if (p->son != NULL) { 
printf("invalid insertion\n"); i 
exit(1); ( 
} /* end if */ 
p->son = list; 
} /* end setsons */ 
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Another common operation is addson(p,x),,in which p points to a node in a tree and 
it is desired to add a node containing x as the youngest son of node(p). The C routine 
to implement addson is as follows. The routine calls the auxiliary function getnode, 
which allocates a node and returns a pointer to it. 


void addson(NODEPTR p, int x) 
{ ; 
NODEPTR q; 


if (p == NULL) { 
printf("void insertion\n"); 
exit(1); 
} /* end if */ ; ' 

« /* the pointer q traverses the list of sons £y 
/* of p. r is one node behind q */ 
r= NULL; ` 
q = p->son; 

. while (q != NULL) { 
req; 
q = Q->next; : 
} /* end while */ a 
/* At this point, r points to the youngest */ 
/* son of p, or is null if p has no sons */ 
q = getnode(); 
q->info = x; 
q->next = NULL; 
if (rF == NULL) /* p has no sons */ 
P->son = q; 
else 
r->next = q; 
} /* end addson */ 


Note that to add a new son to a node, the list of existing sons must be traversed. 
Since adding a son is a common operation, a representation is often used that makes this 
operation more efficient. Under this alternative representation, the list of sons is ordered 
from youngest to oldest rather than vice versa. Thus son(p) points to the youngest son 
of node(p), and next(p) points to its next older brother. Under this representation the 
routine addson may be written as follows: 


** void addson(NODEPTR p, int x) 
{ 
NODEPTR q; 


if (p == NULL) { 
printf("invalid insertion\n"); 
exit(1); . i 

} /* end if */ 
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q = getnode(); 
q->info = x; 
Q->next = p->son; 
P->son = q; 


} /* end addson */ 


EXERCISES 


5.5.1. 
5.5.2. 
5.5.3. 


5.5.4. 


5.5.5. 


5.5.6. 


5.5.7. 


5.5.8. 


5.5.9. 


How many trees exist with n nodes? 

How many trees exist with n nodes and maximum level m? 

Prove that if m pointer fields are set aside in each node of a general tree to point to a 
maximum of m sons, and if the number of nodes in the tree is n, the number of null son 
pointer fields is n * (m — 1) + 1. 


If a forest is represented by a binary tree as in the text, show that the number of null 
right links is 1 greater than the number of nonleaves of the forest. 

Define the breadth-first order of the nodes of a general tree as the root followed by all 
nodes on level 1, followed by all nodes on level 2, and so on. Within each level, the 
nodes should be ordered so that children of the same father appear in the same order as 
they appear in the tree and, if n1 and n2 have different fathers, n1 appears before n2 if 
the father of m1 appears before the father of n2. Extend the definition to a forest. Write 
a C program to traverse a forest represented as a binary tree in breadth-first order. 
Consider the following method of transforming a general tree, gt, into a strictly binary 
tree, bt. Each node of gr is represented by a leaf of br. If gt consists of a single node, b 
consists of a single node. Otherwise br consists of a new root node and a left subtree 
It, and a right subtree, rt. It is the strictly binary tree formed recursively from the oldes 


‘subtree of gt, and rt is the strictly binary tree formed recursively from gt without its 


oldest subtree. Write a C routine to convert a general tree into a Strictly binary tree. 


Write a C function compute that accepts a pointer to a tree representing an expression 
with constant operands and returns the result of evaluating the expression without de- 
stroying the tree. 

Write a C program.to convert an infix expression into an expression tree. Assume that 
all nonbinary operators precede their operands. Let the input expression be represented 
as follows: an operand is represented by the character ‘N’ followed by a number, an 
operator by the character ‘T’ followed by a character representing the operator, and a 
function by the character ‘F’ followed by the name of the function. 


Consider the definitions of an expression, a term, and a factor given at the end of Sec- 
tion 3.2. Given a string of letters, plus signs, asterisks and parentheses that forms a valid 
expression, a parse tree can be formed for the string. Such a tree is illustrated in Figure 
5.5.8 for the string “(A + B) + (C + D)”: Each node in such a tree represents a substring 
and contains a letter (E for expression, T for term, F for factor, or S for symbol) and 
two integers. The first is the position within the input string where the substring repre- 
sented by that node begins. and the second is the length of the substring. (The substring 
represented by each node is shown below that node in the figure.) The leaves are all $ 
nodes and represent single symbols of the original input. The root of the tree must be 
an E node. The sons of any’non-S node N represent the substrings which make up the 
grammatical object represented by N. Write a C routine that accepts such a string and 
constructs a parse tree for it. 
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5.6 EXAMPLE: GAME TREES 


One application of trees is to game playing by computer. We illustrate this application 
by writing a C program to-determine the “best” move in tic-tac-toe from a given board 
position. 

Assume that there is a function evaluate that accepts a board position and an 
indication of a player (X or O) and returns a numerical value that represents how “good” 

` the position seems to be for that player (the larger the value returned by evaluate, the 
better the position). Of course, a winning position yields the largest possible valué, and 
a losing position yields the smallest. An example of such an evaluation function for tic- 
tac-toe is the number of rows, columns, and diagonals remaining open for one player 
minus the number remaining open for his or her opponent (except that the value 9 would 
be returned for a position that wins, and —9 for a position that loses). This function 
does not “look ahead” to consider any possible board positions that might result from 
the current position; it merely evaluates a static board position. 

Given a board position, the best next move ould be determined by considering 
all possible moves and resulting positions. The move selected should be the one that 
results in the board position with the highest evaluation. Such an analysis, however, 
does not necessarily yield the best move. Figure 5.6.1 illustrates a position and the five 
possible moves that X can make from that position. Applying the evaluation function 
Just described to the five resulting positions yields the values shown. Four moves yield 
the same maximum evaluation, although three of them are distinctly inferior to the 
fourth. (The fourth position yields a certain viciory for X, whereas the other three can 
be drawn by O.) In fact, the move that yields the smallest evaluation is as good or better 
than the moves that yield a higher evaluation. The static evaluation function, therefore, 
is not good enough to predict the outcome of the game. A better evaluation function 
could easily be produced for the game of tic-tac-toe (even if it were by the brute-force 
method of listing all positions and the appropriate response). but most games are too 
complex for static evaluators to determine the best response. 

Suppose that it were possible to look ahead several moves. Then the choice of 
a move could be improved considerably. Define the look ahead level as the number 
of future moves to be considered, Starting at any position, it is possible to construct 
a tree of the possible board positions that may result from each move. Such a tree is 
called a game tree. The game tree for the opening tic-tac-toe position with a look-ahead 
level of 2 is illustrated in Figure 5.6.2, (Actually other positions do exist, but because of 
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symmetry considerations these are effectively the same as the positions shown.) Note 
that the maximum level (called the depth) of the nodes in such a tree is equal to the 
look-ahead level. 

Let us designate the player who must move at the root's game position as plus 
and his or her opponent as minus. We attempt to find the best move for plus from the 
root’s game position. The remaining nodes of the tree may be designated as plus nodes 
or minus nodes, depending upon which player must move from that node’s position. 
Each node of Figure 5.6,2 is marked as a plus or as a minus node. 

Suppose that the game positions of all the sons of a plus node have been evaluated 
. for player plus. Then clearly, plus should choose the move that yields the maximum 
evaluation. Thus, the value of a plus node to player plus is the maximum of the values 
of its sons. On the other hand, once plus has moved, minus will select the move that 
yields the minimum evaluation for player plus. Thus the value of a minus node to player 
plus is the minimum of the values of its sons. 1 : 

Therefore to decide the best move for playe: us from’ the root, the Positions in 
the leaves must be evaluated for player plus using a static evaluation function. These 
values are then moved up the game tree by assigning to each plus node the maximum 
of its sons’ values and to each minus node the minimum of its sons’ values, on the 
assumption that minus will select the move that is worst for plus. The value assigned to 
each node of Figure 5.6.2 by this process is indicated in that figure immediately below 
the node. 

The move that plus should select, given the board position in the root node, is 
the one that maximizes its value. Thus the opening move for X should be. the middle 
square, as illustrated in Figure 5.6.2. Figure 5.6.3 illustrates the determination of O's 
best reply. Note that the designation of “plus” and “minus” depends on whose move is 
being calculated. Thus, in Figure 5.6.2 X is designated as plus, whereas in Figure 5.6.3 


Figure 5.6.3 Computing O's reply. 
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` 
O is designated as plus. In applying the static evaluation function to a board position, 
the value of the position to whichever player is designated as plus is computed. This 
method is called the minimax method, since, as the tree is climbed the maximum and 
minimum functions are applied alternately. 

The best move for a player from a given position may be determined by first 
constructing the game tree and applying a static evaluation function to the leaves. These 
values are then moved up the tree by applying the minimum and maximum at minus 
and plus nodes, respectively. Each node of the game tree must include a representation 
of the board and an indication of whether the node is a plus node or a minus node. Nodes 
may therefore be declared by 


struct nodetype { 
char board(3} (3); 
int turn; 
struct nodetype *son; 
struct nodetype *next; 
J; 


typedef struct nodetype *NODEPTR; 


p— >board{i)[j] has the value ‘X’, ‘O’, or * ’, depending on whether the square in 
row i and column j of that node is occupied by ‘either of the players or is unoccupżed. 
p— >turn has the value +1 or —1, depending on whether the node is a plus or minus 
node, respectively. The remaining two fields of a node are used to position the node 
within the tree. p— >son points to the oldest son of the node, and p— >next points to 
its next younger brother. We assume that the foregoing declaration is global, that an 
available list of nodes has been established, and that appropriate getnode and freenode 
routines have been written. 

The C function nextmove(brd, player, looklevel, newbrd) computes the best next 
move. brd is a 3-by-3 array representing the current board position, player is ‘X’ or 
‘O’, depending on whose move is being computed (note that in tic-tac-toe the value of 
player could be computed from brd, so that this parameter is not strictly necessary), 
and looklevel is the look-ahead level used in constructing the tree. newbrd is an output 
parameter that represents the best board position that can be achieved by player from 
Position brd. 

nextmove uses two auxiliary routines, buildtree and bestbranch. The function 
buildtree builds the game tree and returns a pointer to its root. The function bestbranch 
computes the value of two output parameters: best, which is a pointer to the tree node 
representing the best move, and value, which is the evaluation of that move using the 
minimax technique. 


void Pare eres brd{) (31, ‘int looklevel, char player, char newbrd[][3]) 
{ 


NODEPTR ptree, best; 
int i, j, value; 
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ptree = buildtree(brd, looklevel); 
bestbranch(ptree, player, &best, -Gvalue); 
for (i=0; i < 3; ++i) 
for (j=0; j < 3; +j) ' 
newbrd[i] [j] = best->board[i][j); 
} /* end nextmove */ 


The function buildtree returns a pointer to the root of a game tree. It uses the 
auxiliary function getnode, which allocates storage for a node and returns a pointer to 
it. It also uses a routine expand(p, plevel, depth), in which p is a pointer to a node in 
a game tree, plevel is its level, and depth is the depth of the game tree that is to be 
constructed. expand produces the subtree rooted at p to the proper depth. 


NODEPTR buildtree(char brd[}(3], int looklevel) 
{ 


NODEPTR ptree; 
int i, j; 


/* create the root of the tree and initialize it */ 
tree = getnode(); 
for (i=0; i < 3; ++i) 
for (j=0; j < 3; +j) 
ptree->board[i}[j] = brd{i}[j); 
/* the root is a plus node by definition */ 
ptree->turn = 1; 
ptree->son = NULL; 
ptree->next = NULL; 
/* create the rest of the game tree */ 
expand(ptree, 0, looklevel); 
return(ptree); f 
} /* end buildtree */ 


expand may be implemented by generating all board positions that may be ob- 
tained from the board position pointed to by p and establishing them as the sons of p 
in the game tree. expand then calls itself recursively using these sons as parameters 
until the desired depth is reached. expand uses an auxiliary function generate, which 
accepts a board position brd and returns a pointer to a list of nodes containing the board 
` Positions that can be obtained from brd. This list is linked together by the next field. 
We leave the coding of generate as an exercise for the reader. 


void expand(NODEPTR p, int plevel, int depth) 
{ ne 

NODEPTR q; 

if (plevel < depth) { 

‘ /* p is not at the maximum level */ 


q = generate(p->board); ; ! 
p->son = Q; 
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while (q != NULL) { š 
/* traverse the list of nodes */ 
if (p->turn == 1) 
q->turn = -1; 
else 
q->turn = 1; 
q->son = NULL; 
expand(q, plevel+1, depth); 
q = Q->next; 
} /* end while */ 
} /* end if */ 
} /* end expand */ 


Once the game tree has been created, bestbranch evaluates the nodes of the tree. 
When a pointer to a leaf is passed to bestbranch, it calls a function evaluate that stat- 
ically evaluates the board position of that leaf for the player whose move we are de- 
termining. The coding of evaluate is left as an exercise. When a pointer to a nonleaf 
is passed to bestbranch, the routine calls itself recursively on each of its sons and ther 
assighs the maximum of its sons’ values to the nonleaf if it is a plus node, and the 
minimum if it is a minus node. bestbranch also keeps track of which son yielded this 
minimum or maximum value. 

If p— >turn is —1, the node pointed to by p is a minus node and it is to be as-_ 
signed the minimum of the values assigned to its sons. If, however, p— >turn is +1, the 
node pointed to by p is a plus node and its value should be the maximum of the values 
assigned to the sons of the node. If min(x,y) is the minimum of x and y, and max(x,y) 
is their maximum, min(x,y) = —max(—x,— y) (you are invited to prove this as a triv- 
ial exercise). Thus, the correct maximum or minimum can be found as follows: in the 
case of a plus node, compute the maximum; in the case of a minus node, compute the 
maximum of the negatives of the values and then reverse the sign cf the result. These 
ideas are incorporated into bestbranch. The Output parameters *pbest and *pvalue are, 
respectively, a pointer to that son of the tree’s root that maximizes its value and the 
value of that son that has now been assigned to the root. 


void bestbranch(NODEPTR pnd, char player, NODEPTR *pbest, 
int *pvalue) 
{ 


NODEPTR p, pbest2; 
int val; 


if (pnd->son == NULL) { 
/* pnd is a leaf */ 
_ *pvalue = evaluate(pnd->board, player); 
“~*pbest = pnd; ' 
} 


else { . 

/* the node is not a leaf, traverse the list of sons bth 
p = pnd->son; 

bestbranch(p, player, pbest, pvalue); 
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*pbest = p; 
if (pnd.turn == -1) 
*pvalue = -*pvalue; 
p = p->next; 
while (p != NULL) { P 
‘bestbranch(p, player, &pbest2, aval); 
if. (pnd->turn == -1) * 
val = -val; >- 
if (val-> *pvalue) { 
*pvalue = val; 
*pbest =p; 
} /* {end if */ 
p = p->next; : 
}. /* end while */ 
if (pnd->turn == -1) 
*pvalue = -*pvalue; 
} /* end if */ 


l 


} /* end bestbranch */ 


Sn 


EXERCISES j 


5.6.1. 
5.6.2. 
5.6.3. 
5.6.4. 
5.6.5. 


5H4. 


5.6.7. 


Examine the routines of this section and determine whether all the parameters are ac- 
wally necessary. How would you revise the parameter lists? 3 
Write the C routines generate and evaluate as described in the text. 

Rewrite the programs of this and the preceding section under the implementation in 
which each tree node includes a field father containing a pointer to its father. Under 
which implementation are they more efficient? ! 
Write nonrecursive versions of the routines expand and bestbranch given in the text. 
Modify the routine bestbranch in the text so that the nodes of the tree are freed after 
they are no longer needed. 5 

Combine the processes of building the game tree and evaluating its nodes into a single 
process so that the entire game tree need not exist at any one time and nodes are freed 
when no longer necessary. : É ; 
Modify the program of the previous exercise so that if the evaluation of a minus node 
is greater than the minimum of the values of its father’s older brothers, the program 
does not bother expanding that minus node’s younger brothers, and if the evaluation of 
a plus node is less than the maximum of the values of its father’s older brothers, the 
program does not bother expanding that plus node’s younger brothers. This method is 
called the alpha-beta minimax method. Explain why it is correct. 

The game of kalah is played as follows: Two players each have seven holes, six of 


which are called pits and the seventh a kalah. These are arranged according to the 
following diagram. 


Player 1 


KPPPPPP 
PPPPPPK 


Player 2 
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5.6.9, 


5.6.10. 


5.6.11. 
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Initially there are $ix stones in each pit and no stones in either kalah, so that the opening 
Position looks like this: 3 


0666666 
6666660 


The players alternate turns, each turn ‘consisting of one or more moves. To makea move, 
a player chooses one of his or her nonempty pits. The stones are removed from that 
pit and are distributed counterclockwise into the pits and into that player's kalah (the 
opponent’s kalah is skipped), one stone per hole, until there are no stones remaining. For 


example, if player 1 moves first, a possible opening move might result in the following 
board position: 


1777770 
66666606 


If a player's last stone lands in his or her own kalah, the player gets another move. If 
the last stone lands in one of the player’s own pits that is empty, that stone and the 
stones in the opponent's pit directly opposite are removed and placed in the player's 
kalah. The game ends when either player has no stones remaining in his or her pits. At 
that point, all the stones in the opponent's pits are placed in the opponent’s kalah and 
the game ends, The player with the most stones in his or her kalah is the winner. © 
Write a program that accepts a kalah board Position and an indication of whose tum it 
is and produces that player's best move, 


How would you modify the ideas of the tic-tac-toe program to compute the best move 
in a game that contains an element of chance, such as backgammon? 


Why have computers been programmed to play perfect tic-tac-toe but not perfect chess 
or checkers? 

The game of nim is played as follows: Some number of sticks are placed in a pile. 
Two players alternate in removing either one or two sticks from the pile. The player to 
remove the last stick is the loser, Write a C function to determine the best move in nim. 


Trees . Chap.5 


Sorting and searching are among the most common ingredients of programming sys- 
tems. In the first section of this chapter we discuss some of the overall considerations in- 
volved in sorting. In the remainder of the chapter we discuss some of the more common 
sorting techniques and the advantages or disadvantages of one technique over another. 
In the next chapter we discuss searching’and some applications. 


6.1 GENERAL BACKGROUND 


The concept of an ordered set of elements is one that has considerable impact on our 
daily lives. Consider, for example, the process of finding a telephone number in a tele- 
phone directory. This process, called a search, is simplified considerably by the fact 
that the names in the directory are listed in alphabetical order. Consider the trouble you 
might have in altempting to locate a telephone number if the names were listed in the 
order in which the customers placed their phone orders with the telephone company. 
In such a case, the names might as well have been entered in random order, Since 
the entries are sorted in alphabetical rather than in chronological order, the process 
of searching is simplified. Or consider the case of someone searching for a book in a 
-library. Because the books are shelved in a specific order (Library of Congress, Dewey 
System, and so forth), each book is assigned a specifi¢ position relative to the others 
and can be retrieved in a reasonable amount of time (if it is there). Or consider a set 
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of numbers sorted sequentially in a computer’s memory. As we shall see in the next 
chapter, it is usually easier to find a particular element of such a set if the numbers are 
maintained in sorted order. In general, a set of items is kept Sorted in order to either 
produce a report (to simplify manual retrieval of information, as in a telephone book or 
a library shelf) or to make machine access to data more efficient. Ya 

We now present some basic terminology. A file of size n is a sequence of n items 
r{O], r[1], ..., rim — 1}. Each item in the file is called a record. (The terms file and 
record are not being used here as in C terminology to refer to specific data stryctures. 
Rather, they are being used in a more general sense.) A key, k{i], is associated with 
each record r{i]. The key is usually (but not always) a subfield of the entire record. 
The file is ‘said to be sorfed on the key if i < j implies that k[i] precedes k{j] in some 
ordering on the keys. In the example of the telephone book, the file consists of al! the 
entries in the book. Each entry is a record. The key upon which the file is sorted is the 
name field of the record. Each record also contains fields for an address and a telephone 
number: i T 

A sort can be classified as being internal if the records that it is sorting are in main 
memory, or external if some of the records that it is sorting are in auxiliary storage. We . 
restrict our attention to internal sorts. 

It is possible for two records in a file to have the same key. A sorting technique 
is called stable if for all records i and j such that kf i] equals k(j], if r[i] precedes r[j] in 
the original file, r[‘] precedes r[j] in the sorted file. That is, a stable sort keeps records 
with the same key in the same relative order that they were in before the sort. 

A sort takes_place either on the records themselves or on an auxiliary table of 
pointers. For example, consider Figure 6.1.1a, in which a file of five records is shown. 
If the file is sorted in increasing order on the numeric key shown, the resulting file is as 
shown in Figure 6.1.1b. In this case the actual records themselves have been sorted. 

Suppose, however, that the amount of data stored in each of the records in the - 
file of Figure 6.1.1a is so large that the overhead involved.in moving the actual data is 
prohibitive. In this case an auxiliary table of pointers may be used so that these pointers 
are moved instead of the actual data, as shown in Figure 6.1.2. (This is called sort-- 
ing by address.) The table in the center is the file, and the table at the left is the initial 
table of pointers. The entry in position j in the table of pointers points to record j. During 


Key Other fields 


Record 1 
Record 2 
Record 3 
Record 4 


Record § 


(a) Original file. (b) Sorted file. Figure 6.1.1 Sorting actual records. 
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Original * Sorted 
pointer . 


“ Figure 6.1.2 Sorting by using an auxiliary table of pointers. 


the sorting process, the entries in-the pointer table are adjusted sc that the final table ` 
is as shown at the right. Originally, the first pointer was to the first entry in the file; 
upon completion the first pointer is to the fourth entry in the table. Note that none of 
the original file entries are moved, In most of the programs in this chapter we illustrate 
techniques of sorting actual records. The extension of these techniques to sorting by 
address is straightforward and will be left as an exercise for the reader. (Actually, for 
the sake of simplicity, in the examples presented in this chapter we sort only the keys: 
we leave to the reader to modify the programs to sort:full records.) : 
Because of the relationship between sorting and searching, the first question to 
ask in any application is whether or not a file should be sorted. Sometimes there is 
less work involved in searching a set of elements for a particular one than to first sort 
the entire set and to then extract the desired element. On the other hand, if frequent 
use of the file is required for the purpose of retrieving specific elements, jt might be 
more efficient to sort the file. This is because the overhead of successive searches may 
far exceed the overhead involved in sorting the file once and subsequently retrieving 
elements from the sorted file. Thus it cannot be said that it is more efficient either to sort 
or not to sort. The programmer must make a decision based on individual circumstances. 
Once a decision to sort has been made, other decisions must be made, incliding what 
is to be sorted and what methods are to be used. There is no one sorting method that is 
universally superior to all others. The programmer must carefully examine the problem 
and the desired results before deciding these very important questions. : 


Efficiency Considerations 


As we shall see in this chapter, there are a great number of methods that can be 
used to sort a file. The programmer. must be aware of several interrelated and often 
conflicting efficiency considerations to make an intelligent choice about which sorting 
method is most appropriate to a particular problém. Three of the most important of 
these considerations include the length of time that must be spent by the programmer in 
coding a particular sorting program, the amount of machine time nécessary for running 
the program, and the amount of space necessary for the program. : 
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If a file is small, sophisticated sorting techniques designed to minimize space and 
time requirements are usually worse or only marginally better in achieving efficiencies 
than simpler, generally less efficient methods. Similarly, if a particular sorting program 
is to be run only once and there is sufficient machine time and space in which to run it, 
it would be ludicrous for a programmer to spend days investigating the best methods 
of obtaining the last ounce of efficiency. In such cases the amount of time that must be 
spent by the programmer is properly the overriding consideration in determining which 
sorting method to use. However, a strong word of caution must be inserted. Program- 
ming time is never a valid excuse for using: an incorrect program. A sort which is run 
only once may be able to afford the luxury of an inefficient technique, but it cannot 
afford an incorrect one. The presumably sorted data may be used in an application in 
which the assumption of ordered data is crucial. 

However, a programmer must be able to recognize the fact that a particular sort 

: is inefficient and must be able to justify its use in a particular situation. Too often, pro- 
grammers take the easy way out and code an inefficient sort, which is then incorporated 
into a larger system in which the sort is a key component. The designers and planners 
of the system are then surprised at the inadequacy of their creation. To maximize his 
or her own efficiency, a programmer must be knowledgeable of á wide range of sorting 
techniques and be cognizant of the advantages and disadvantages of each, so that when 
the need for a sort arises he or she can supply the one which is most appropriate for the 
particular situation. 

This brings us to the other two efficiency considerations: time and space. As in 
most computer applications, the programmer must often optimize one of these at the 
expense of the other. In considering the time necessary to sort a file of size n we do 
not concern ourselves with actual time units, as these will vary from one machine to 
another, from one program to another, and from one set of data to another. Rather, we 
are interested in the corresponding change in the amount of time required to sort a file 
induced by a change in the file size n. Let us see if we can make this concept more 
precise, We say that y is proportional to x if the relation between y and x is such that 
multiplying x by a constant multiplies y by that same constant. Thus if y is proportional 
to x, doubling x will double y, and multiplying x by 10 will multiply y by 10. Similarly, 
if y is proportional to x2, doubling x will multiply y by 4 and multiplying x by 10 will 
multiply: y by 100. 

Often we do not measure the time efficiency of a sort by the number of time units 
required but rather by the number of critical operations performed. Examples of such 
critical operations are key comparisons (that is, the comparisons of the keys of two 
records in the file to determine which is greater), movement of records or pointers to 
records, or interchanges of two records. The critical operations chosen are those that take 
the most time. For example, a key comparison may be a complex operation, especially 
if the keys themselves are long or the ordering among keys is nontrivial. Thus a key 
comparison requires much more time than say, a simple increment of an index variable 
in a for loop. Also, the number of simple operations required is usually Proportional to 
the number of key comparisons. For this reason, the number of key comparisons is a 
useful measure of.a sort’s time efficiency, 

There are two ways to determine the time requirements of a sort, neither of which 
yields results that are applicable to all cases. One method is to go through a sometimes 
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a+b) 

n a= 0.01? b = 10n +p ee? 

10 1 100 101 LOI 

50 25. 500 .:525 0.21 

100 100 1,000 1,100 0. 

500 2,500 5,000 7,500 0.03 

1,000 10,000 10,000 20,000 0.02 

5,000  _ 250,000 50,000 300,000 0.01 

10,000 1,000,000 100,000 1,100,000 0.01 

50,000 25,000,000 500,000" 25,500,000 0.01 

100,000 100,000,000 1,000,000 101,000,000 0.01 

500,000  2,500,000,000 5,000,000  2,505,000,000 0.01 
Figute 6.1.3 


intricate and involved mathematical analysis of various cases (for example, best case, 
worst case, average case). The result of this analysis is often a formula giving the aver- 
age time (or number of operations) required for a particular sort as a function of the file 
size n. (Actually the time requirements of a sort depend on factors other than the file 
size; however, we concern ourselves here only with the dependence on the file size.) 
Suppose that such a mathematical analysis on a particular sorting program results in 
the conclusion that the program takes 0.01n?+ 10n time units to execute. The first and 
fourth columns of Figure 6.1.3 show the time needed by the sort for various values of 
n. You will notice that for small values of n, the quantity 10n (third column of Figure 
6.1.3) overwhelms the quantity 0.01n2 (second column). This is because the difference 
between n? and n is small for small values of n and is more than compensated for by 
the difference between 10 and 0.01. Thus, for small values of n, an increase in n by a 
factor of 2 (for example, from 50 to 100) increases the time needed for sorting by ap- 
proximately that same factor of 2 (from 525 to 1100). Similarly, an increase in n by a 
factor of 5 (for example, from 10 to 50) increases the sorting time by approximately 5 
(from 101 to 525). i 
However, as n becomes larger, the difference between n? and n increases so 
quickly that it eventually more than compensates for the difference between 10 and 
0.01. Thus when n equals 1000 the two terms contribute equally to the amount of time 
“needed by the program. As n becomes even larger, the term 0.01n? overwhelms the 
term 10n and the contribution of the term 10n becomes almost insignificant. Thus. 
for large values of n, an increase in n by a factor of 2 (for example, from 50,000 to 
100,000) results in an increase in sorting time of approximately 4 (from 25.5 million 
to 101 million) and an increase in n by a factor of 5 (for example, from 10,000 to 
50,000) increases the sorting time by approximately a factor of 25 (from 1.1 million 
to 25.5 million). Indeed, as n becomes larger and larger, the sorting time becomes 
more closely proportional to n?, as is clearly illustrated by the last column of Fig- 
ure 6.1.3. Thus for large n the time required by the sort is almost proportional to ns 
Of course, for small values of n, the sort may exhibit drastically different behav- 
ior (as in Figure 6.1.3), a situation that must be taken into account in analyzing its 
efficiency. 
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O Notation : 


\ To capture the concept of one function becoming proportional to another as it 
grows, we introduce some terminology and a new notation. In the previous example, 
the function 0.01n?+ 10n is said to be “on the order of” the function n? because, as n 
becomes large, it becomes more nearly proportional to n?. 

To be precise, given two functions f(n) and g(n), we say that f(n) is on the or- 
der of g(n) or that f(n) is O(g(n)) if there exist positive integers a and b such that 
f(n) S a » g(n) for all n = b. For example, if f(n) = n?+ 100n and g(n) = rè, 
fín) is O(g(n)), since n2+ 100n is less than or equal to 2n? for all n greater than 
or equal to 100. In‘this case a equals 2/and b equals 100. This same fín) is also 
O(n*), since n?+ 100n is less than or equal to 2n? for all n greater than or equal 
to 8. Given a function f(n). there may be many functions g(n) such that f(n) is 
O(g(n)). = 

If f(n) is O(g(n)), “eventually” (that is, for n = b) fín) be¢omes permanently 
smaller or equal to some multipleof g(n). In a sense we are saying that f(n) is bounded 
by g(n) from above, or that f(n) is a “smaller” function than &(n). Another formal way 
of saying this is that fin) is asymptotically bounded by g(n). Yet another interpretation 
is that f(n) grows more slowly than g(n), since, Proportionately (that is, up to a factor 
of a), g(n) eventually becomes larger. 

It is easy to show that if f(n) is O(g(n)) and &(n) is O(h(n)), fin) is O(h(n)). For 
example,'n?+ 100n is O(n), and n? is O(n) (to see this, set a and b both equal to 1): 
consequently n?+ 100n is O(n? ). This is called the transitive property. 

Note that if f(n) is a constant function [that is, f(n) = c for all n), fin) is OC), 
since, setting a to c and b to 1, we have that c < c * ] forall n = 1. (In fact, the value 
of b or n is irrelevant, since a constant function’s value is independent of n.) 

* Itis also easy to show that the function c» n is O(n*) for any constants c and k. To 
see this, simply note that c * n is less than or equal to c * nk for any n = | (that is, set 
a = c and b = |). It is also obvious that 7 is O(n'*/) for any j = 0 (use. a = 1, 

= 1). We can also show that if f(n) and g(n) are both O(h(n)), the new function fin) + 
(n) is.also O(h(n)). All these facts together can be used to show that if f(n) is any poly- 
nomial whose leading power is k [that is, f(n) = cypenk + cyank-l yey Ck*n+ Cki, 
fin) is O(n*). Indeed, fin) is O(n'*4) for any j = 0. i f 

Although a function may be asymptotically bounded by many other functions {as 
for example, 10n? + 37n + 153 is OU), O(10n2), O(37n* + 10n) and O(0.05n°)], we 
usually look for an asymptotic bound that is a single term with a leading coefficient of 
l and that is as “close a fit” as possible. Thus we would say that 107? + 37n + 153 is 
OUT), although it is alsó asymptotically bounded by many other functions. Ideally, we 
would like to find a function (12) such that f(r) is O(g(n)) and &(n) is Of (n)). If fn) 
is a constant or a polynomial, this can always be done by using its highest term with a 
coefficientof 1. For more complex functions, however, it is not always possible to find 
such a tight fit. i 

An important function in the study of algorithm efficiency is the logarithm func- 
tion. Recall that log,, n is the value x such that m* equals n: m is called the base of 
the logarithm. Consider the functions logm nnd log, n. Let xm be logm n and xk be 
tog; n. Then `a 
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so that 


Taking the log,, of both sides, 


xm = log,,(k™) 
Now it can easily be shown than log. (x”) equals y * log. x for any x, y, and z, so that 
the last equation can be rewritten as (recall that xm = log,,,n) 
dog, n = xk * logn k 
or as (reca that xk = log, n) 


att log,, n = (log,, k) * log, n 
Thus log,» and log, n are constant multiples of each other. 

It is easy to show that if f(n) = c * g(n), where c is a constant, f(n) is O(g(n)) 
[indeed, we have already shown that this is true for the function f(n) = n*]. Thus log,, 
nis O(log, n) and log, n is O(log m n) for any m and k. Since each logarithm function 
is on the order of any other, we usually omit the base when speaking of functions of 
logarithmic order and say that all such functions are O(log n). 

The following facts establish an order hierarchy of functions: 
cis O(1) for any constant c. 

c is O(log n), but log n is not O(1). 

cxlog, nis O(log n) for any constants c, k. 

clog, n is O(n), but n.is not O(log 7). . 

c * n* is O(n*) for any constants c, k. 

c * n* is O(n'*/), but n**/ is not O(n*). 

c» n*log, nis O(n log n) for any constants c, k. 

c* n*log, n is O(n*), but n? is not O(n log n). 

c * nJ#log, n is O(n/log n) for any constants c, j, k. 

c-» ni*log, nis O(ns*!), but nJ*! is not O(n! log n): í 
c» n/*(log, n)! is O(n! (log n)') for any constants c, j, k; L. 

© c nix(log, n) i is Ost!) but n/*! is not O(n7 (log n’). 

cx n/s(logs n)! is Oly} (log n)'*!) but n’ (logy n)!*! is not O(n? (log nyy 
c*nt is O(d"), but d" is not O(n“) for any constants c and k, and d > 1. 


The hierarchy of functions established by these facts, with each function of lower order 
than the next, is c, log n. (log n)*, n, n(log n)*, n*, n*(log n)!, n*+!, and d". 

Functions that are O(n*) for some k are said to be of polynomial order, whereas 
functions that are O(d") for some d > | but not O(n‘) for any k are said to be of expo- 
nential order. 

The distinction between polynomial-order functions and exponential-order func- 
tions is extremely important. Even a small exponential-order function. such as 2", grows 
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far larger than any polynomial-order function, such as n‘, regardless of the size of k. 
As an illustration of the rapidity with which exponential-order functions grow, consider 
that 2! equals 1024 but that 2! (that is, 1024!°) is greater than the number formed by 
a 1 followed by 30 zeros. The smallest.k for which 10* exceeds 2° is 4, but the smallest 
k for which 100‘ exceeds 2! is 16. As n becomes larger, larger values of k are needed 
for né to keep up with 2". For any single k, 2” eventually becomes permanently larger 
than n“. 

Because of the incredible rate of growth of exponential-order functions, problems 
that require exponential-time algorithms for solution are considered to be intractable on 


current computing equipment; that is, such problems cannot be solved precisely except 
in the simplest cases. 


Efficiency of Sorting 


Using this concept of the order of a sort, we can compare various sorting tech- 
niques and classify them as being “good” or “bad” in general terms. One might hope to 
discoyer the “optimal” sort that is O(n) regardless of the contents or order of the input. 
Unfortunately, however, it can be shown that no such generally useful sort exists. Most 
of the classical sorts we shall consider have time requirements that range from O(n log n) 
to O(n’). In the former, multiplying the file size by 100 will multiply the sorting time by 
less than 200; in the latter, multiplying the file size by 100 multiplies the sorting time 
by a factor of 10,000. Figure 6.1.4 shows the comparison of n log n with n? for a range 
of values of n. It can be seen from the figure that for large n, as n increases, n? increases 
at a much more rapid rate than n log n. However, a sort should not be selected simply 
‘because it.is O(n log n). The relation of the file size n and the other terms constituting 
the actual sorting time must be known. In particular, terms which play an insignificant 
role for large values of n may play a very dominant role for small values of n. All these 
issues must be considered before an intelligent sort selection can be made. 

A second method of determining time requirements of a sorting technique is to 
actually run the program and measure its efficiency (either by measuring absolute time 
units or the number of operations performed). To use such results in measuring the ef- 
ficiency of a sort the test must be run on “many” sample files. Even when such statistics 


n nlogion n? 

1x10! 1.0 x 10! 1.0 x 10? 

5 x 10! 8.5 x 10! 2.5 x 103 

1x10? 2.0 x 10? 1.0 x 10* 

5x10? | 1.3 x 10° 2.5 x 108 

1x 10° 3.0 x 10° 1.0 x 10° 

5x10 1.8 x 104 2.5 x 10 

1x 10° 4.0 x 104 1.0 x 108 

5x10) 23x10 25x 10° 

1x 10° 5.0 x 10° 1.0 x 10'° 

5x 10° 28x10 25x10" 

1 x 10° 6.0 x 10° 1.0 x 10! 

5x 10° 3.3 x 107 2.5 x 104 Figure 6.1.4 Comparison of n log n 
1x10’ 7.0 x 10 1.0 x 10" and 7? for various values of n. 
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have been gathered, the application of that sort to a specific file may not yield results 
that follow the general pattern. Peculiar attributes of the file in question may make the 
sorting speed deviate significantly. In the sorts of the subsequent sections we shall give 
an intuitive explanation of why a particular sort is classified as O(n?) or O(n tog n); we 
leave mathematical analysis and sophisticated testing of empirical data as exercises for 
the ambitious reader. 

In most cases the time needed by a sort depends on the original sequence of the 
data. For some sorts, input data which is almost in sorted order can be completely sorted 
in time O(n), whereas input data that is in reverse order needs time that is O(n"). For 
other sorts the time required is O(n log.n) regardless of the original order of the data. 
Thus, if we have some knowledge about the original sequence of the data we can make 
a more intelligent decision about which sorting inethod to select. On the other hand, if 
we have no such knowledge we may wish to select a sort based on the worst possible 
case or based on the “average” case. In any event, the only general comment that can 
be made about sorting techniques is that there is no “best” general sorting technique. 
The choice of a sort must, of necessity, depend on the specific circumstances. 

Once a particular sorting technique has be > selected, the programmer should 
then proceed to make the program as efficient as possible. In many programming appli- 

„Cations it is often necessary to sacrifice efficiency for the sake of clarity. With sorting, 
the situation is usually the opposite. Once a sorting program has been written and tested, 
the programmer’s chief goal is to improve its speed, even if it becomes less readable. 
The reason for this is that a sort may account for the major part of a program’s effi- 
ciency, so that any irfiprovement in sorting time significantly affects-overall efficiency. 
Another-reason is that a sort is often used quite frequently, so that a small improvement 
in its execution speed saves a great deal of computer time. It is usually a good idea to 
remove function calls, especially from inner loops, and replace them with the code of 
the function in line, since the call-return mechanism of a language can be prohibitively 
expensive in terms of time. Also, a function call may involve the assignment of storage 
to local variables, an activity that sometimes requires a call to the operating system. In 
many of the programs we do not do this so as not to obfuscate the intent of the program 
with huge blocks of code... 

Space constraints are usually less important than time considerations. One rea- 
son for this is that, for most sorting programs, the amount of space needed is closer 
to O(n) than to O(n?). A second reason is that if more space is required it cah almost 
always be found in auxiliary storage. An ideal sort is an in-place sort whose additional 
space requirements are O(1). That is, an in-place sort manipulates the elements to be 
sorted within the array or list space that contained the original unsorted input. Addi- 
tional space that is required is in the form of a constant number of locations (such as 
declared individual program variables) regardless of the size of the set to be sorted. 

Usually, the expected relationship between time and space holds for sorting al- 
gorithms: those programs that require less time usually require more space, and vice 
versa. However, there are clever algorithms that utilize both minimum time and min- 
imum space; that is, they are O(n log n) in-place sorts. These may, however, require 
more programmer time to develop and verify. They also have higher constants of pro- 
portionality than many sorts that do use more space or that have higher time-orders and 

an requis more time to sort small sets. 
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In the remaining ‘sections we investigate some of the more popular sorting tech- 
niques and indicate some of their advantages and disadvantages. 


EXERCISES 


6.1.1. 


6.1.4. 
6.1.5. 


Choose any sorting technique with which you are familiar. 

(a) Write a program for the sort. 

(b) Is the sort stable? 

(c) Determine the time requirements of the sort as a function of the file size, both 
mathematically and empirically. 

(d) What is the order of the sort? 

(e) At what file size does the most dominant term begin to overshadow the others? __ 

Show that the function (log,, 2)‘ is O(n) for ail m and k but that n is not O((log n)*) for 

any k. 

Suppose that a time requirement is given by the formula a * n? + b * n*log n, where 

a and b are constants. Answer the following questions by both proving your results 

mathematically and writing a program to validate the results empirically. 

(a) For what values of n (expressed in terms of a and b) does the first term dominate 
the second? 

(b) For what value of n (expressed in terms of a and b) are the two terms equal? 


(c) For what values of n (expressed in terms of a and b) does the second term domi- 
nate the first? 


Show that any process that sorts a file can be extended to find all duplicates in the file. 
A sort decision tree is a binary tree that represents a sorting method based on com- 
parisons. Figure 6.1.5 illustrates such a decision tree for a file of three elements. Each 
nonleaf of such a tree represents a comparison between two elements. Each leaf repre- 
sents a completely sorted file. A left branch from a nonleaf indicates that the first key 
was smaller than the second; a right branch indicates that it was larger. (We assume 
that all the elements in the file have distinct keys.) For example, the tree of Figure 6.1.5 
represents a sort on three.elements x[0]. x[ 1]. x{2] that proceeds as follows: 

Compare x{0] with x{1]. If x{0] < a[l], compare .x{1] with x[2], and if x{1}] < x{2}. 
the sorted order of the file is x[O}. x{ 1}, x[2]; otherwise if x[0] < x{2], the sorted order is 

F 


Figure 6.1.5 Decision tree for a file of 
three elements. 
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MO}. x12). af |]. and if x10) > x12]. the sorted order is x[2}, x[0}, x[1]. If x{O] > xf1), 

proceed in a similar fashion down the right subtree. ; 

(a) Show that a sort decision tree that never makes a redundant comparison (that is, 
never compares x{/] and aj] if the relationship between xfi] and xfj] is known) 
has st! leaves. 

(b) Show that the depth of such a decision trec is at least log: (n!). 

(c) Show that n! = (n / 2)" 2, so that the depth of such a tree is O(n log n). 

(d) Explain why this proves that any sorting method that uses comparisons on a file 
of size n must make at least O(n log n) comparisons. 

6.1.6. Given a sort decision tree for a file as in the previous exercise, show that if the file 
contains some equal elements, the result of applying the tree to the file (where either a 
left or right branch is taken whenever two elements are equal) is a sorted file. 

6.1.7. Extend the concept of the binary decision tree of the previous exercises tu a ternary 
tree that includes the possibility of equality. It is desired to determine which elements 
of the file are equal, in addition to the order of the distinct elements of the file. How 
many comparisons are necessary? 

6.1.8. Show that if k is the smallest integer greater than or equal to n + log. n — 2, k com- 
parisons are necessary and sufficient to find the 'argest and second largest elements of 
a set of n distinct elements. 3 

6.1.9. How many comparisons are necessary to find the largest and smallest of a set of n 
distinct elements? 


6.1.10. Show that the function f(n) defined by 


fay =1 
f(n) = ftn—1)+ 1 nforn> | 


is O(log n). 


6.2 EXCHANGE SORTS 


Bubble Sort 


The first sort we present is probably the most widely known among beginning 
students of programming: the bubble sort. One of the characteristics of this sort is “’ 
it is easy to understand and program. Yet. of all the sorts we shall consider, it is pr 
the least efficient. 

In each of the subsequent examples. x is an array of integers of which the first n 
are to be sorted so that x[i] = xU) forO s i<j <n. it is straightforward to extend this 
simple format to one which is used in sorting 7 records, each with a subfield key k. 

The basic idea underlying the bubble sort is to pass through the file sequentially 
several times. Each pass consists of comparing each element in the file with its successor 
(x{i] with x[i + 11) and interchanging the two elements if they are not in proper order. 
Consider the following file: . 


25 57 48 37 12 92 86 33 
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The following comparisons are made on the first pass: 


x[0]. with x[1] (25 with 57) No interchange 
x{1} with x[2] (57 with 48) Interchange 
x(2} with x[3] (57 with 37) Interchange 
x[3] with x[4] (57 with 12) Interchange 
x[4), with x[5] (57 with 92) No interchange 
x{5} with x[6] (92 with 86) Interchange 
x{6] with x[7] (92 with 33) Interchange 


Thus, after the first pass, the file is in the order 
25 48 37 12 57 86 33 92 


Notice that after this first pass, the oe (in this case 92) is in its proper 
position within the array. In general, x[n — i] Will be in its proper position after iteration 
i. The method is called the bubble sort because each number slowly “bubbles” up to its 
proper position. After the second pass the file is 


i 25 37%% 12 48 57 33 86 92 


Notice that 86 has now found its way to the second highest position. Since each iteration 


places a new element into its proper position, a file of n elements requires no more than 
n — 1 iterations. 


The complete set of iterations is the following: 


lteration O (original file) 25 57 48 37 12 92 86 33 


Iteration | 25 48 37 12 57 86 33 92 
Iteration 2 25 37 12 48 57 33 86 92 
Iteration 3 25 12 37 48 33 57 86 92 
Iteration 4 12 25 37 33 48 57 86 92 
Iteration 5 12 25 33 37 48 57 86 92 
Iteration 6 12 25 33 37 48 57 86 92 
Iteration 7 12 25 33 37 48 57 86 92 


On the basis of the foregoing discussion we could proceed to code the bubble sort. 
However, there are some obvious improvements to the foregoing method. First, since 
all the elements in positions greater than or equal to n — i are already in proper position 
after iteration i, they need not be considered in succeeding iterations. Thus on the first 
pass n — 1 comparisons are made, on the second pass n — 2 comparisons, and on the 
(n — 1)th pass only one comparison is made (between x[0] and x[1]). Therefore the 
process speeds up as it proceeds through successive passes. 

We have shown that n — 1 passes are sufficient to sort a file of size n. However, 
in the preceding sample file of eight elements, the file was sorted after five iterations, 
making the last two iterations unnecessary. To eliminate unnecessary passes we must 
be able to detect the fact that the file is already sorted. But this is a simple task, since in 
a sorted file no interchanges are made on any pass. By keeping a record of whether or 
not any interchanges are made in a given pass it can be determined whether any further 
passes are necessary. Under this method, if the file can be sorted in fewer than n — | 
passes, the final pass makes no interchanges. 


Sorting Chap. 6 


. 340 


* \ 

Using these improvements, we present a routine bubble that accepts two variables 
xand n. xis an array of numbers, and n is an integer representing the number of elements 
to be sorted. (n may be less than the number of elements in E) 


void bubble(int x[], int n) 
{ 


int hold, j, pass; 
int switched = TRUE; 


for (pass = 0; pass < n-1 & switched == TRUE; pass++) { 
Je outer loop controls the number of passes sfs 
switched = FALSE; /* initially no interchanges have */ 
j /* been made on this pass, */ 
for (j = 0; j < n-pass-1; j++) I 


/* inner loop governs each individual pass EA 
if (x[j] > x[j+1]) { a 
Yh elements out of order ai 
Je an interchange is necessary */ 
switched = TRUE; 
hold = x[j]; 
x[j] = x[j+1]; i 7 


x[j+1] = hold; 
} /* end if */ 
} /* end for */ 
} /* end bubble */ 


What can be said about the efficiency of the bubble sort? In the case of a sort 
that does not include the two improvements outlined previously, the analysis is simple. 
There are n — | passes and n — 1 comparisons on each pass. Thus the total number 
of comparisons is (n — 1) *(n — 1) = n? — 2n + 1, which is O(n?). Of course, the 
number of interchanges depends on the original order of the file. However, the number 
of interchanges cannot be greater than the number of comparisons. It is likely that it 
is the number of interchanges rather than the number of comparisons that takes up the 
most time in the program’s execution. . 

Let us see how the improvements that we introduced.affect the speed of the bubble 
sort. The number of comparisons on iteration i is n — i. Thus, if there are k iterations the 
total number of comparisons is (n — 1) + (n — 2) + (n — 3)+--++(n—k), which equals 
(2kn — kK? — k)/2. It can be shown that the average number of iterations, k, is O(n), 
so that the entire formula is still O(n"), although the constant multiplier is smaller than 
before. However, there is additional overhead involved in testing and initializing the 
variable switched (once per pass) and setting it to TRUE (once for every interchange). 

The only redeeming features of the bubble sort are that it requires little additional 
space (one additional record to hold the temporary value for interchanging and several 
simple integer variables) and that it is O(n) in the case that the file is completely sorted 
(or almost completely sorted). This follows from the observation that only one pass of 


n — | comparisons (and no interchanges) is necessary to establish that a sorted file is 
sorted. i 
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There are some other ways to improve the bubble sort. One of these is to ybserve 
that the number of passes necessary to sort the file is the largest distance by which a 
number must move “down” in the array. In our example, for instance, 33, which starts 
at position 7 in the array, ultimately finds its way to position 2 after five iterations. The 
bubble sort can be speeded up by having successive passes go in opposite directions so 
that the small elements move quickly to the front of the file in the same way that the 


large ones move to the rear. This reduces the required number of passes. This version 
is left as an exercise. 


Quicksort 


The next sort we consider is the partition exchange sort (or quicksort). Let x be 
an array, and n the number of elements in the array to be sorted. Choose an element 
a from a specific position within the array (for example, a can be chosen as the first 
element so that a = x[0]). Suppose that the elements of x are partitioned so that a is 
placed into position j and the following conditions hold: 


1. Each of the elements in positions 0 through j — 1 is less than or equal to a. 
2. Each of the elements in positions j + 1 through n — | is greater than or equal 
toa. . 


Notice that if these two conditions hold for a particular a and j, a is the jth smallest 
element of x; so that a remains in position j when the array is completely sorted. (You 
are asked to prove this fact as an exercise.) If the foregoing process is repeated with 
the subarrays x[0] through x[j — 1] and x[j + 1] through x[n — 1] and any subarrays 
created by the process in successive iterations, the final result is a sorted file. 

Let us illustrate the quicksort with an example: If an initial array is given as 


25 57 48 37 12 92 86 33 
and the first element (25) is placed in its proper position, the resulting array is 
12 25 57 48 37 92 86 33 


At this point, 25 is in its proper position in the array (x[1]), each element below 
that position (12) is less than or equal to 25, and each element above that position (57, 
48, 37, 92, 86, and 33) is greater than or equal to 25. Since 25 is in its final position the 
original problem has been decomposed into the problem of sorting the two subarrays 


(12) and (57 48 37 92 86 33) 


Nothing need be done to sort the first of these subarrays; a file of one element is 
already sorted. To sort the second subarray the process is repeated and the subarray is 
further subdivided. The entire array may now be viewed as 


12 25 (57 48 37 92 86 33) 


where parentheses enclose the subarrays that are yet to be sorted. Repeating the process 
on the subarray x[2] through x[7] yields 


12 25 (48 37 33) 57 (92 86) 
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and further repetitions yield 


12 25 (37 33) 48 57 (92 86) 
12 25 (33) 37 48 57 (92 86) 
12 25 33 37 48 57 (92 86) 
12 25 33 37 48 57 (86) 92 
12°25 33 37 48 57 86 92 


Note that the final array is sorted. 


By this time you should have noticed that the quicksort may be defined most 
conveniently as a recursive procedure. We may outline an algorithm quick(/b,ub) to 


sort all elements in an array x between positions lb and ub (lb is the lower boun:!, ub 
the upper bound) as follows: 


` if (Ib >= ub) 
return; ` j2 array is sorted #/ 


partition(x, 1b,ub, j); /* partition the elements of the */ 
/* subarray such that one of the */ 
/* elements (possibly x[1b]) is / 
/* now at x[j] (j is an output ay, 
Vt parameter) and: */ 
/* 1. x{i] < xij) for Ib &ei<j */ 
/* 2. x[i] >= x{j] for j < i <= ub */ 
/* x[j] is now at its final */ 
1t position */ 


` quick(x,1b,j - 1); /* recursively sort the subarray */ 
/* between positions 1b and j-1 */ 


quick(x,j + 1,ub); /* recursively sort the subarray */ 
; /* between positions j+landub */ 


There are now two problems. We must produce a mechanism to implement partition 
and produce a method to implement the entire process nonrecursively. 

The object of partition is to allow a specific element to find its proper position 
with respect to the others in the subarray. Note that the manner in which this partition 
is performed is irrelevant to the sorting method. All that is required by the sort is that 
the elements be partitioned properly. In the preceding example, the elements in each 
of the two subfiles remain in the same relative order as they appear in the original file. 
However, such a partition method is relatively inefficient to implement. 

One way to effect a partition efficiently is the following: Leta = x{/b] be the 
element whose final position is sought. (There is no appreciable efficiency gained by 
selecting the first element of the subarray as the one which is inserted into its proper 
position; it merely makes some of the pregrams easier to code.). Two pointers, up and 
down, are initialized to the upper and lower bounds of the subarray, respectively. At any 

| 
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point during execution, eachelement in a position above up is greater than or equal to 
a, and each element in a position below down is less than or equal to a. The two pointers 
up and down are moved towards each other in the following fashion. 


Step 1: Repeatedly increase the pointer down by one position until x[down] > a. 
Step 2: Repeatedly decrease the pointer up by one position until x[up] <= a. 
Step 3: If up > down, interchange x[down] with x[up]. 


The process is repeated until the condition in step 3 fails (up <= down), at which point 
x{up] is interchanged with x[/b] (which equals a), whose final position was sought, and 
jis set to up. : 

We illustrate this process on the sample file, showing the positions of up and down 
as they are adjusted. The direction of the scan is indicated by an arrow at the pointer 
being moved. Three dsterisks on a line indicates that an interchange is being made. 


a = x[Tb] = 25 
down--> up 
°25 57 48 37 12 92 86 33 
down up 
25 57 48 37 12 92 B86 33 
down <--up 
25 57 48 37 12 92 86 33 
down . (<-sup 
25 57 48 37 2 92 86 33 
down <--up ; 
25 57 48 37 12 92 86 33 
down up 
25 57 48 37 12 92 86 33 
down up 
r 2 48 37 57 92 86 33 wan 
down--> up 


25 122 48 37 57 92 86 33 

25 12 48 y. 9 92 . 86 33 
(down ep 

25 2 48 37 ‘$7 92 86 33 


25 2 48 37 57 92 86 33 
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: ` <--up, down 
25 12 48 37 57 92 86 33 


up down 
25 12 48 37 57 92 86 33 


up down ‘ 
12 25 48 37 57 92 86 33 aee 


Atthis point 25 is in its proper position (position 1), and every element to its left is less 

than or equal to 25, and every element to its right is greater than or equal to 25. We 

could now proceed to sort the two subarrays (12) and (48 37 57 92 86 33) by applying 
` the same method. 


This particular algorithm can be implemented by the following procedure. 
void partition (int x[], int 1b, int ub, int *pj) 
{ 


int a, down, temp, up; 


a = x[(1b]; /* a is'the element whose final */ 
MED i position is sought t7 

up = ub; "ea : ` 

down = Tb; 


while (down < up) { 
while (x[down] <='a && down < ub) 
down++; J: . move up the array */. 
while (x[up] > a) ty 
up--; i move down the array */ 
if (down < up) { 
/* interchange xfdown) and x[up] */ 
temp = x[down]; 
x{down] = x[up]; 
x[Up} -=~temp; 
} /* end if */ f 
} /* end while */ i e 
x[lb] = x[up]; ; 
` x[up] = a; 
*pj = up; 
} /* end partition */ 


Note that if kequals ub — Ib + 1, so that we are rearranging a subarray of size k, 
the routine uses k key comparisons (of x[down] with a and x[up] with a) to perform the 
Partition. . 

The routine can be made slightly more efficient by eliminating some of the re- 
dundant tests. You are asked to do this as an exercise. 

Although the recursive quicksort algorithm is relatively clear in terms of what it 
accomplishes and how, it is desirable to avoid the overhead of routine calls in programs 
such as sorts in which execution efficiency is a significant consideration. The recursive 
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calls to quick can easily be eliminated by using!a stack as in Section 3.4, Once partition 
has been executed, the current parameters to quick are no longer needed, except in 
computing the arguments to the two subsequent recursive calls. Thus instead of stacking 
the current parameters upon each recuisive call, we can compute and stack the new 
parameters ‘or each of the two recursive calls, Under this approach, the stack at any 
point contains the lower and upper bounds of all subarrays that must yet be sorted. 
Furthermore, since the second recursive call immediately precedes the return to the 
calling program (as in the Towers of Hanoi problem), it may be eliminated entirely and 
replaced with a branch. Finally, since the order in which the two recursive calls are 
made does not affect the correctness of the algorithm, we elect iñ each case to stack the 
larger subarray and process the smaller subarray immediately. As we explain shortly, 
this technique keeps the size of the stack to a miaimum. 

We may now code a function to implement the quicksort. As in the case of bubble, 
the parameters are the array x and the number of elements of x that we wish to sort, n. 
The routine push pushes lb and ub onto the stack, popsub pops them from the stack, 
and empty determines if the stack is empty. 

#define MAXSTACK ... /* maximum stack size */ 

void quicksort(intxf], int n) 


int i, j; 
Struct bndtype { 
int 1b; 
int ub; 
} newbnds; ~. _ 
/* stack is used ‘by-the pop, push and empty functions */ 
struct { 
int top; 
struct bndtype bounds (MAXSTACK] ; 
} stack; 


stack.top = -1; 
newbnds.1b = 0 
newbnds.ub = n-1; 
push(&stack, &newbnds); 
/* repeat as long as there are any */ 
/* unsorted subarrays on the stack */ 
while (!empty(&stack)) { . 
popsub(&stack, &newbnds); 
while (newbnds.ub > newbnds.1b) { 
/* process next subarray */ 
partition(x, newbnds.1b, newbnds.ub, &j); 
/* stack the larger subarray */ 
if. (j-newbnds.1b > newbnds.ub-j) { 
/* stack lower subarray */ 
i = newbnds.ub; 
newbnds.ub = j-1; 
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push(&stack, &newbnds); 

/* process upper subarray */ 
newbnds.1b = j+1; 

newbnds.ub = i; 


else { 
/* stack upper subarray */ 
i = newbnds.1b; 
newbnds.1b = j+1; 
push(&stack, &newbnds) ; 
/* process lower subarray */ 
newbnds.1b = i; 6 
newbnds.ub = j-1; 
} /* end if */ 
} /* end while */ 
} /* end while */ 
} /* end quicksort */ 


The routines partition, empty, popsub, and push should be inserted in line for 
maximum efficiency. Trace the action of quicksort on the sample file. 

Note that we have chosen to use x{/b] as the element around which to partition 
each subfile because of programming convenience in the proc.‘dure partition, but any 
other element could have been chosen as well. The element around which a file is par- 
titioned is called a pivot. It is not even necessary that the pivot be an element of the 
subfile; partition can be written with the header 


partition(1b, ub, x, j, pivot) 


to partition x[/b] through x[ub] so that all elements between x{/b] and x[j — 1] are 
less than pivot and all elements between x[j) and x[ub] are greater than or equal to 
pivot. In that case the element x[j] is itself included in the second subfile (since it 
is not necessarily in its proper position), so that the second recursive call to quick is 
quick(x, j, ub) rather than quick(x, j + 1, ub). 

Several choices for the pivot value have been found to improve the efficiency of 
quicksort by guaranteeing more nearly balanced subfiles. The first technique uses the 
median of the first, last, and middle elements of the subfile to be sorted (that is. the 
median of x[/b]. x[ub], and x{(/b + ub)/2]) as the pivot value. This median-of-three 
value is closer to the median of the subfile being partitioned than x[/b}, so that the two 
partitions of the subfile are more nearly equal in size. In this method the pivot value is an 
element of the file, so that quick (x, j + 1, ub) can be used as the second recursive call. 

A second method, called meansort, utilizes x{/b] or the median-of-three as pivot 
when partitioning the original file but adds code in partition to compute the means 
(averages) of the two subfiles being created. In subsequent partitions the mean of each 
subfife, calculated when the subfile was created, is used as a pivot value. Again, this 
mean is closer to the median of the subfile than x{/b} and results in more nearly balanced 
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files. The mean is-not necessarily an element of the file, so that quick (x, j, ub) must 
be used as the second recursive Call. The cade to find the méan does not require any 
additional key comparisons but does add some extra overhead, 

Another technique, called Bsort, uses the middle element of a subfile as the pivot. 
During partition, whenever the pointer up is decieased, x[up] is interchanged with 
x{up + 1) if x{up] > x[up + 1]. Whenever the pointer down is increased, x[down] 
is interchanged with x[down — 1) if x[down] < x[down — 1]. Whenever x[up] and 
x[down] are interchanged x{ up] is interchanged with x[up + 1] if x{up) > {yp + 1), 
and x[down] is interchanged with x|down — 1) if x[down] < x[down — 1). This guaran- 
tees that x[up] is always the smallest element in the right subfile (from xfup] to x[ub}) 
and that x[down] is always the largest element in the left subfile (from x[lb] to x[down)). 

This allows two optimizations: If’no interchanges between x[up] and xlup + J) 
were required during the partition, the right subfile is known to be sorted and need not 
be stacked, and if no interchanges between.x{down] and x[down — 1] were required, the 
left subfile is known to be sorted and need not be stacked. This is similar to the technique 
of keeping a flag in bubblesort that detects that no interchanges have taken place during 
an entire pass so that’no additional passes are necessary. Second, a subfile of size 2 is 
known to be sorted and need not be stacked. A subfile of size 3 can be directly sorted 
with just a single comparison and possible interchange (between the first two elements 
in a left subfile and between the last two in a right subfile). Both optimizations in Bsort 
reduce the number of subfiles that must be processed. 


Efficiency of Quicksort 


How efficient is the quicksort? Assume that the file size n is a power of 2, say 
n = 2", so that m= logy n. Assume also that the Proper position for the pivot always 
turns out to be the exact middle of the subarray. In that case there will be approximately 
n comparisons (actually n — 1) on the first pass, after which the file is split into two 
subfiles each of size n/2, approximately. For each of these two files there are approx- 
imately n/2 comparisons, and a total of four files each of size n/4 are formed. Each of 
these files requires 7/4 comparisons yielding a total of n/8 subfiles. After halving the 
subfiles m times, there are n files of size 1. Thus the total number of comparisons for 
the entire sort is approximately a. a 


n+2*(n/2)+4*(n/4)-+8*(n/8)4+--- +n (n/n) 
or 


n+n+n+n+::: + n(m terms) 


- comparisons. There are m terms because the file is subdivided m times. Thus the total 
number of comparisons is O(n * m) or O(n log n) (recall that m = log> n). Thus if the 
foregoing properties describe the file, the quicksort is O(n log n), which is relatively 
efficient. 

For the unmodified quicksort in which x{/b] is used as the pivot value, this anal- 
ysis assumes that the original array and all the resulting subarrays are unsorted, so 
that the pivot value x{/b] always finds its Proper position at the middle of the subarray. 
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Suppose that the preceding conditions do not hold and the original array is sorted (or 
almost sorted). If, for example, x{/b] is in its correct position, the original file is split 
into subfiles of sizes 0 and n — 1. if this process continues, a total of n — 1 subfiles are 
sorted, the first of size n, the second of size n — 1, the third of size n — 2, and so on. 


Assuming k comparisons to rearrange a file of size k, the total number of comparisons 
to sort the entire file is 


n+(n—1)+(n-—2)+---+2 


which is O(n). Similarly, if the original file is sorted in descending order the final posi- 
tion of x[/b] is ub and the file is again split into two subfiles that are heavily unbalanced 
(sizes n — 1 and 0). Thus the unmodified quicksort has the seemingly absurd property 
that it works best for files that are “completely unsorted” and worst for files that are 
completely sorted. The situation is precisely the opposite for the bubble sort, which 
works best for sorted files and worst for unsorted files. . 

It is possible to speed up quicksort for sorted files by choosing a random element, 
of each subfile as the pivot value. If a file is known to be nearly sorted, this might be 
a good strategy (although, in that case choosing the middle element as a pivot would 
be even better). However, if nothing is known about the file, such a Strategy does not 
improve the worst case behavior, since it is possible (although improbable) that the 
random element chosen each time might consistently be the smallest element of each 
subfile. As a practical matter, sorted files are more common than a good random number 
generator happening to choose the smallest element repeatedly. 

The analysis for the case in which the file size is not an integral power of 2 is 
similar but slightly more complex: the results, however, remain the same. It can be 
shown, however, that on the average (over all files of size n), the quicksort makes ap- 
proximately 1.3867 log, n comparisons even in its unmodified version. In practical 
situations, quicksort is often the fastest available because of its low overhead and its 
average O(n log n) behavior. l 

If the median-of-three technique is used, quicksort can be O(n log 'n) even if the 
file is sorted (assuming that partition leaves the subfiles sorted). However, there are 
pathclogical files in which the first, last, and middle elements of each subfile are al- 
ways the three smallest or largest elements. In such cases, quicksort remains O(n*). 
Fortunately, these are rare. 

Meansort is O(n log n) as long as the elements of the file are uniformly distributed 
between the largest and smallest. Again, some rare distributions may make it O(n), but 
this is less likely than the worst case of the other methods. Forrandom files, meansort does . 
not offer any significant reductions in comparisons or interchanges over standard quick- 
sort. Its significant overhead for computing the mean requires far more CPU time than 
standard quicksort. For a file known to be almost sorted, meansort does provide signif- 
icant reduction in comparisons and interchanges. However, the mean-finding overhead 
makes it slower than quicksort unless the file is very close to being completely sorted. 

Bsort requires far less time than quicksort or meansort on sorted or nearly sorted 
` input, although it does require more comparisons and interchanges than meansort 

for nearly sorted input (but meansort has Significant overhead in finding the mean). It 
requires fewer comparisons but more interchanges than meansort and more of both than 
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quicksort for randomly sorted input. However, its CPU requirements are far lower than 
meansort’s, although somewhat greater than quicksort for random input. 

Thus Bsort can be recommended if the input is known to be nearly sorted or if we 
are willing to forgo moderate increases in average sorting time to avoid very large in- 
creases in worst-case sorting time. Meansort can be recommended only for input known 
to be very nearly sorted and standard quicksort for input likely to be random or if aver- 
age sorting time must be as fast as possible. In Section 6.5, we present a technique that 
is faster than either Bsort or meansort on nearly sorted files. 

The space requirements for the quicksort depend on the number of nested recur- 
sive calls or on the size of the stack. Clearly, the stack can never grow larger than the 
number of elements in the original file. How much smaller than n the stack grows de- 
pends on the number of subfiles generated and on their sizes. The size of the stack is 
somewhat contained by always stacking the larger of the two subarrays and applying 
the routine to the smaller of the two. This guarantees that all smaller subarrays are sub- 
divided before larger subarrays, giving the net effect of having fewer elements on the 
stack at any given time. The reason for this is that a smaller subarray will be divided 
fewer times than a larger subarray. Of course, the larger subarray will ultimately be 
processed and subdivided, but this will occur after the smaller subarrays have already 
been sorted and therefore removed from the stack. 

Another advantage of quicksort is locality of reference. That is, over a short period 
of time all array accesses are to one or two relatively small portions of the array (a | 
subfile or portion thereof). This insures efficiency in the virtual memory environment, 
where pages of data are constantly being swapped back and forth between external 
and internal storage. Locality of reference results in fewer page swaps being required 
for a particular program. A simulation study has shown that in such an environment, 
quicksort uses less space-time resources than any other sort considered. 


EXERCISES 


6.2.1. Prove that the number of passes necessary in the bubble sort of the text before the file is 
in sorted order (not including the last pass, which detects the fact that the file is sorted) 
equals the Jargest distance by which an element must move from a larger index to a 
smaller index. 

6.2.2. Rewrite the routine bubble so that successive passes go in opposite directions. 


6.2.3. Prove that. in the sort of the previous exercise, if two elements are not interchanged 
during two consecutive passes in opposite directions, they are in their final position. 

6.2.4. A sort by counting is performed as follows. Declare an array count and set count{i] to 
the number of elements that are less than x[i]. Then place x[i] in position count{i] of an. 
output array. (However. beware of the possibility of equal elements.) Write a routine 
to sort an array x of size n using this method. 

6.2.5. Assume that a file contains integers between a and b, with, many numbers repeated 
several times. A distribution sort proceeds as follows. Declare an array number of size 
b - a + 1, and set number|i — a) to the number of times that integer i appears in the 
file. and then reset the values in the file accordingly. Write a routine to sort an array + 


of size n containing integers between a and b by this method. 
= 
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6.2.6. The odd-even transposition sort proceeds as follows. Pass through the file several 
times. On the first pass. compare x{i] with x[i + 1] for all odd i. On the second pass, 
compare x[i] with x[i + 1] for all even i. Each time that xfi) > xli + 1], interchange 
the two. Cont! iue alternating in this fashion until the file is sorted. 

(a) What is the condition for the termination of the sort? 
(b) Write aC routine to implement the sort. 
t (c) On the average what is the efficiency’of this sort? 


6.2.7. Rewrite the program for the quicksort by starting with the recursive algorithm and 
applying the methods of Chapter 3 to produce a nonrecursive version. 

6.2.8. Modify the quicksort program of the text so that if a subarray is small. the bubble sort 
is used, Determine, by actual computer runs, how small the subarray should be so that 
this mixed strategy will be more efficient that an ordinary quicksort. 

6.2.9. Modify partition so that the middle value of x{/b], x{ub}, and x[ind} (where ind = 
(ub + Ib)/2) is used to partition the array. In what cases is the quicksort using this 
method more efficient than the version of the text? In what cases is it less efficient? 

6.2.10. Implement the meansort technique. partition should use the mean of the subfile be- 
ing partitioned, computed when the subfile was created, as the pivot value and should 
compute the mean of each of the two subfiles that it creates. When the upper and lower 
bounds of a subfile are stacked, its mean should be stacked as well. 

6.2.11. Implement the Bsort technique. The middle element of each file should be used as the 
pivot, the last element of the left subfile being created should be maintained as the 
largest in the left subfile. and the first element of the right subfile should be maintained 
as the smallest in the right subfile. Two bits should be used to keep track of whether the 
two subfiles are sorted at the end of the partition. A sorted subfile need not be processed 
further. If a subfile has three or fewer elements, sort it directly by a single interchange, 
at most. i 

6.2.12. (a) Rewrite the routines for the bubble sort and the quicksort as presented in the text 

and the sorts of the exercises so that a record is kept of the actual number of 
comparisons and the actual number of interchanges made. 

. (b) Write a random-number generator (or use an existing one if your installation has 
one) that generates integers between 0 and 999. 

(c) Using the generator of part (b). generate several files of size 10. size 100, and size 
1000. Apply the sorting routines of part (a) to measure the time requirements for 
each of the sorts on each of the files. ` 

(d) Measure the results of part (c) against the theoretical values presented in this 
section. Do they agree? If not, explain. In particular, rearrarige the files so that 
they are completely sorted and in reverse order and see how the sorts behave with 
these inputs. 


6.3 SELECTION AND TREE SORTING 


A selection sort is one in which successive elements are selected in order and placed 
into their proper sorted positions. The elements of the input may have to be preprocessed 
to make the ordered selection possible. Any selection sort can be conceptualized as the 
following general algorithm that uses a descending priority queue (recall that pginsert 
inserts into a priority queue and pgmaxdelete retrieves the largest element of a priority 
queue). 
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Set dpq to the empty descending priority queue; 
/* preprocess the elements of the input array */ 
/* by inserting them into the priority queue */ 
for (i = 0; i< my i+) 

pqinsert(dpq, x[i]); 
/* select each successive element in order = */ 
for (i = n-1; i >= 0; j--) 

x[i} = pgmaxdelete(dpq); 


This algorithm is called the general selection sort. 

We now examine several different selection sorts. Two features distinguish a spe- 
cific selection sort. One feature is the data structure used to implement the priority 
queue. The second feature is the method used to implement the general algorithm. A 
Particular data structure may allow significant optimization of the general selection sort 
algorithm. A 

Note also that the general algorithm can be modified to use an ascending priority 


queue apq rather than dpq. The second loop that implements the selection phase would 
be modified to 


for (i = 0; i< n; is) 
x[i] = pqmindelete(apq); 


Straight Selection Sort _ 


The straight selection sort, or push-down sort, implements the descending pri- 
ority queue as an unordered array, The input array x is used to hold the priority queue, 
thus eliminating the need for additional ‘space. The straight selection sort is, therefore, 
an in-place sort. Moreover, because the input array x is itself the unordered array that 
will represent the descending priority, the input is already in appropriate format and the 
preprocessing phase is unnecessary. 

Therefore the straight selection sort consists entirely of a selection phase in which 
the largest of the remaining elements, large, is repeatedly placed in its proper position, 
i, at the end of the array. To do so, large is interchanged with the element x[i]. The initial 
n-element priority queue is reduced by one element after each selection. After n — 1 
selections the entire array is sorted. Thus the selection process need be done only from 


n — 1 down to 1 rather than.down to 0. The following C function implements Straight 
selection: i 


void selectsort(int x[], int n) 
int i, indx, j, large; 


for (i = n-1; i > 0; i--) { 
/* place the’ largest number of x[0]. through */ 
/* x[i] into large and its index into indx */ 
large = x[0]; i 
indx = 0; 
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for (j = 1; j <= i; j++) 
if (x[j] > large) { 
large = x[j]; 
indx = j; 
} /* end for... if */ 
x{indx] = x[i]; 
x[i] = large; 
} /* end for */ 
} /* end selectsort */ 


Analysis of the straight selection sort is straightforward. The first pass makes 


n — | comparisons, the second pass makes n — 2, and so on. Therefore, there is a total 
of 


(n= 1) + (n- 2) 4+ (n-3) 4-0-4. = n¥*(n—1)/2 


comparisons, which is O(n). The number of interchanges is always n — | (unless a test 
is added to prevent the interchanging of an elernent with itself). There is little additional 
storage required (except to hold a few temporary variables). The sort may therefore be 
categorized as O(n"), although it is faster than the bubble sort. There is no improvement 
if the input file is completely sorted or unsorted, since the testing proceeds to corapletion 
without regard to the makeup of the file. Despite the fact that it is simple to code, it is 
unlikely that the straight selection sort would be used on any files but those for which 
nis small. 

It is also possible to implement a sort by representing the descending priority 
queue by an ordered array. Interestingly, this leads to a sort consisting of a preprocess- 
ing phase that forms a sorted array of n elements. The selection phase is, therefore, 
superfluous. This sort is presented in Section 6.4 as the simple insertion sort; it is not 
a selection sort, since no selection is required. . ` 


Binary Tree Sorts 


In the remainder of this section we illustrate several selection sorts that represent 
a priority queue by a binary tree. The first method is the binary tree sort of Section 
5.1, which uses a binary search tree. The reader is advised to review that sort before 
proceeding. 

The method involves scanning each element of the input file and piacing it into 
its proper position in a binary tree. To find the proper position of an element; y, a left or 
right branch is taken at each.node, depending on whether y is less than the eiement in 
the node or greater than or equal to it. Once each input element is in its proper position 
in the tree, the sorted file can be retrieved by an inorder traversal of the tree. We present 
the algorithm for this sort, modifying it to accommodate the input as a preexisting array. 
- Translating the algorithm to a C routine is straightforward. 


/* establish the first element as root gi 
tree = maketree(x[0]); 
/* repeat for each successive element */ 
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for (i = 1; i< n; i++) { 
y = xli]; 
q = tree; 
P=q; 
/* travel down the tree until a leaf is reached */ 
while (p != null) { 
9 = p; 
if (y < info(p)) 
p = left(p); 
else 
p = right(p); 
} /* end while +/ 
if (y < info(q)) l : 
setleft(q,y); 
else 
setright(q,y); 
} /* end for */ 
/* the tree is built, traverse it in inorder */ 
intrav(tree); 


To convert the algorithm into a routine to sort an array, it is necessary to revise 
intrav so that visiting a node involves placing the contents of the node into the next 
Position of the original array, 

Actually, the binary search tree represents an ascending priority queue, as de- 
scribed in Exercises 5.1.13 and 5.2.13. Constructing the tree represents the Preprocess- 
ing phase, and the traversal represents the selection phase of the general selection sort 
algorithm. 

Ordinarily, extracting the minimum element (pqmindelete) of a priority queue 
represented by a binary search tree involves traveling down the left side of the tree 
from the root. Indeed. that is the first Step of the inorder traversal process. However. 
Since no new clements are inserted into the tree once the tree is constructed and the 
minimum element does not actually have to be deleted, the inorder traversal efficiently 
implements the successive selection process, 5 

The relative efficiency of this.method depends on the original order of the data. 
If the original array is completely sorted (or sorted in reverse order), the resulting tree 
appears as a sequence of only right (or left) links, as in Figure 6.3.1. In this case the 
insertion of the first node requires no comparisons, the second node requires two com- 
parisons, the third node three comparisons, and so on. Thus the total number of com- 
parisons is 

24+346-4+¢n = ne(n+1)2-4 


which is O0?). 

On the other hand, if the data in the original array is organized so that approxi- 
mately half the numbers following any given number « in the array areJess than a and 
half are greater than a, balanced trees such as those in Figure 6.3.2 result. In such a 
case the depth of the resulting binary tree is the smallest integer greater than or equal 
to log: (n + 1) — 1. The number of nodes at any level / (except possibly for the last) is 
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Origina! data: 


4 8 12°. T7 726 


Number of comparisons: 14 Number of comparisons: 14 


fa) (b) 


Figure 6.3.1 
. s i 
?' and the number of comparisons necessary to place a node at level / (except when 
? = 0)is/ + 1. Thus the total number of comparisons is between 


d- d 
d+® 2! «(1+ land X 2 * (+1) 
l=} f=} 
Original data: Original data: 
fo "Se y i. 26 17. 8 po. te. 


es) © ra) (2) g 


Number of comparisons: 10 Number of comparisons: 19 


fa) ` {b) 


Figure 6.3.2 
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It can be shown (mathematically inclined readers might be interésted in proving this 
fact as an exercise) that the resulting sums are O(n log n). 

Fortunately, it can be shown that if every possible ordering of the input is consid- 
ered equally likely, balanced trees result more often than not. The average sorting time 
for a binary tree sort is therefore O(n log n), although the constant of proportionality is 
larger on the average than in the best case. However, in the worst case (sorted input), the 
binary tree sort is O(n”). Of course, once the tree has been created, time is expended in 
traversing it. If the tree is threaded as it is created, the traversal time is reduced and the 
need for a stack (implicit i in the recursion or explicit in a nonrecursive inorder traversal) 
- is eliminated. 

This sort requires jie one tree node be reserved for each array element. Depend- ; 
ing on the method used to 'implement the tree, space may be required for tree pointers 
and threads, if any. This additional space requirement, together with the poor O(n?) 


time efficiency for sorted or reverse-order input, represents the primary drawback of 
the binary treg sort. 


* Heapsort 


The drawbacks of the binary tree sort are remedied by the heapsort, an in-place 
sort that requires only O(n'log n) operations regardless of the order of the i input. Define 
a descending heap (also called a max heap or a descending partially ordered tree) of 
size n as an almost complete binary tree of n nodes such that the content of each node 
is less than or equal to the content of its father. If the sequential representation of an 
almost complete binary tree is used, this condition reduces to the inequality 


infol j] = infol = 1)/2} for O< (j-1)/2)<jsn-t 


It is clear from this definition of a descending heap that the root of the tree (or the 
first element of the array) contains the largest element in the heap. Also note that any 
path from the root to a leaf (or indeed, any path in the tree that includes no more than 
one node at any level) is an ordered list in descending order. It is also possible to define 
an ascending heap (or a min heap) as an almost complete binary tree such that the 
content of each node is greater than or equal to the content of its father. In an ascending 
heap, the root contains the smallest element of the heap, and any path from the root to 
a leaf is an ascending ordered list.: 

A heap allows.a very efficient implementation of a priority queue. Recall from 
Section 4.2 that an ordered list containing n elements allows priority queye insertion 
(pqinsert) to be implemented using an average of approximately n/2 node accesses, 
and deletion of the minimum or maximum (pqmindelete or pqmaxdelete) using only 
one node access. Thus a sequence of.n insertions and n deletions from an ordered list 
such as is required by a selection sort could require O(n) operations. Although priority 
queue insertion using a binary search tree could require only as few as logs n node 
accesses. it could require as many as n node accesses if the tree is unbalanced. Thus a 
selection sort using a binary search tree could also require O(n?) operations, although 
on the average only O(n log n) are needed. : 


Sorting Chap. 6 


356 


As we shall see, a heap allows both insertion and deletion to be implemented 
in. O(log n) operations. Thus a selection sort Consisting of n insertions and n deletions 
can be implemented using a heap in O(n log n) operations, even in the worst case. An 
additional bonus is that the heap itself can be implemented within the i input array x using 
the sequential implementation of an almost complete binary ‘tree. The only ae 


space required is for program variables: The heapscit is, therefore, an O(n log n) in- 
place sort. 


Heap as a Priority Queue 7 
Let us how implement a descending priority queue using a descending heap. Sup- 
pose that dpq is an array that implicitly represents a descending heap of size k. Because 
the priority queue is contained in array elenients 0 to k — 1, we add k as a parameter of 
the insertion and deletion operations. Then the operation pqinsert(dpq. k, elt) can be im- 
plemented by simply inserting elz into its proper position in the descending list formed 
by the path from the root of the heap (dpq{0}) to the leaf dpq{k]. Once pginsert(dpg, k. 
elt) has been executed, dpq becomes a heap of size k + 1. k 
The insertion is done by traversing the path from the empty position k to position 
0 (the root), seeking the first element greater than or equal to elt, When that element 
is found, elt is inserted immediately preceding it in the path (that i is, elt is inserted as 
its son). As each ele.nent less than el? is passed during the traversal, it is shifted down 
one level in the tree to make room for éir. (This shifting i is necessary because we are 
using the sequential representation rather than a linked representation of the tree, A 
new element cannot be inserted between two existing elements without shifting some 
existing elements.) - 
This heap insertion operation is also called the siftup opini because elt sifts 
its way up the tree. The following.algorithm implements pqinsert (dpq, k, elt): 
aie re 
f= {s-1)/2; /* fis the father of s */ 
while (s > 0 && dpqif] < elt) -{ 
dpq{s} = dpal f); 
s = f; /* advance up the tree */ 
f = (s - 1/2; 
} /* end while */ 
dpq[s] = elt; 


Insertion is clearly O(log n), since an almost complete binary tree with n nodes 
has log: n +1 levels, and at most, one node per level is accessed. 

We now examine how to implement pqmaxdelete (dpq,k) for a descending heap 
of size k. First we define subtree (pm), where m is greater than p. as-the subtree 
(of the descending heap) rooted at position p within the elements dpg[p] through 
dpq\m). For example. subrree(3.10) consists of the root dpq{3} and its two chil- 
dren dpq{[7} and dpq[8}. subtree(3;17) consists of dpg[3)}. dpg{7), daq|8)2 dpg{ 15}. 
dpal 16}. and dpg{17]. If dpgli] is included in subtree(pan), dpal2 «i 1) is in- 
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cluded if and only if 2*i +1 <= m, and dpg{2*i + 2) is included if and only if 
2*i+2<= m. If mis less than p, subtree(p,m) is defined as the empty tree, 

To implement pqmaxdelete(dpq,k), we note that the maximum element is always 
at the root of a k-element descending heap. When that element is deleted, the remain- 
ing k — 1 elements in Positions 1 throygh k — 1 must be redistributed into positions 
O through k — 2 so that the resulting array segment from dpq{0) through dpg[k — 2] 
remains a descending heap. Let adjustheap(root,k) be the operation of rearranging the 
elements dpq[root + 1] through dpq{k] into dpg{ root] through dpq{k — 1] so that sub- 
tree(root,k — 1) forms a descending heap. Then pqmaxdelete(dpq,k) for a k-element 
descending heap can be implemented by 


p = dpa[0); 
adjustheap(0,k - 1); 
return(p); 


In a descending heap, not only is the root element the largest element in the 
tree, but an element in any position p must be the largest element in subtree(p,k). 
Now, subtree(p,k) consists of three groups of elements: its root, dpg{p]; its left sub- 
tree, subtree(2 * p + 1, k); and its right subtree, subtree(2 * p + 2, k). dpq{2 * p + 1}, 
the left son of the root, is the largest element of the left subtree, and dpq{2 * p + 2), the 
right son of the root, is the largest element of the right subtree. When the root dpg{p] is 
deleted, the larger of these two sons must move up to take its place as the new largest 
element of subrree(p,k). Then the subtree rooted at the position of the larger element 
moved up must be readjusted in turn. 

Let us define /argeson(p, m) as the larger son of dpg{[p] within subtree(p, m). It 
may be implemented as 


S=2* p+]; 
if (s + 1 <= m & x[s] < x[s + 1)) 
S=s+l; 
/* check if out of bounds */ å i 
if (s > m) 
return(-1); 
else 
return(s); 


Then adjustheap(root, k) may be implemented recursively by 


f = root; 

s =.largeson(f, k - 1): 

if (S >= 0 && dpg{k] < dpq{s]}) { 

- dpqlf] = dpals]; . 
adjustheap(s, k); 


else 


dpi f) = dpg[k); 
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The following is an iterative version of adjustheap. The algorithm uses a tempo- 
rary variable kvalue to hold the value of dpalk): 


f = root; 

kvalue = dpq[k]; 

s= largeson(f, k - 1); 

while (s >= 0 & kvalue < dpg{s}) { 
dpa[f) = dpals); 
fes; 
s = largeson(f, k - 1); 


} 
dog[f] = kvalue; ` 


Note that we traverse a path of the tree from the root toward a leaf, shifting up 
by one position all elements in the path greater than dpq{k| and inserting dpg|k| in its 
Proper position in the path. Again, the shifting is necessary because we are using the 
sequential representation rather than a linked implementation of the tree. The adjust- 
ment procedure is often called the sifidown operation because dpq{k] sifts its way from 
the root down the tree. £ 

This heap deletion algorithm is also O(log n), since there are loga n + | levels 
in the tree and at most two nodes are accessed at each level. However, the overhead of 
shifting and computing largeson is significant. 


Sorting Using a Heap 


Heapsort is simply an implementation of the general selection sort using the input 
array x as a heap representing a descending priority queue. The preprocessing phase 
creates a heap of size n using the siftup operation. and the selection phase redistributes 
the elements of the heap in order as it ueletes elements from the priority queue using the 
siftdown operation. In both phases the loops need not include the case where i equals 
0. since x[0] is already a one-element priority queue and the array is sorted once x| 1} 
through xin — 1] are in proper position. 


/* Create the priority queue; before each loop iteration */ 
/* the priority queue consists of elements x{0] through */ 
/* x[i - 1). Each iteration adds x[i] to the aueue. */ 
for (i = 1; i< n; i++) 

pginsert(x, i, x{i]); 
/* select each successive element in order */ 

for (i =n-1; i> 0; i--) 
x[i] = pomaxdelete(x, i + 1); 


Figure 6.3.3 illustrates the creation of a heap of size 8 from the original file 
25 57 48 3%, 1292 86.33 


The dotted lines in that figure indicate an element being shifted down the tree. 


Sec. 6.3 Selection and Tree Sorting 359 


Figure 6.3.3. Creating a heap of size 8. 
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(a) Original tree. 


ta) x[5}: = pgmuxaelete (x, 6) ; (e) x{4]: = pqmaxdelete (x, $) 


Figure 6.3.4 Adjusting a heap. 


© 
(9) 09. 
AD) - 
Q 


t 
(f) x{4]: = pqmaxdelete (x, 4) 


(g) x13): = pqmaxdelete (x, 3) (h) xf2]}: = pqmaxdelete (x, 2). The array is sorted. 


Figure 6.3.4 (cont.) 


Heapsort Procedure 
© We now present a heapsort procedure with all subprocedures (pqginsert, pq- 
maxdelete. adjustheap, and largeson) expanded in-line and integrated for maximal 
efficiency. , y 
void heapsort (int x[], int n) 


int i, elt, s, f, ivalue; 
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/* preprocessing phase; create initial heap */ 
for (i = 1;,i <n; i++) { 
elt = x[i]; 
/* pqinsert(x, i, elt) */ 
s = j; 
f = (s-1)/2; 
while (s > 0 & x[f] < elt) { 
x[s}] = x[f]; 
k H 
f = (s-1)/2; 
} /* end while */ 
x{s] = elt; 
3} /* end for */ 
/* selection phase; repeatedly remove x[0), insert it */ 
/* in its proper position and adjust the heap 3y 
for (i = n-1; i >0; i--) { 
/* pamaxdelete(x, i+1) */ 
ivalue = x[i]; 
x{i] = x[0); 
f = 0; 
/* s = largeson (0, i-1) */ 
if (i == 1) 
S$ =-l; 
else ; 
Sais 
if (i > 2:8& x[2] > x[1]) : “i 
$= 2; 
while (s >= 0 && ivalue < x{s)) { 
x[f] = x[s]; 
f =s; 
/* s = largeson(f, i-1) */ 
S = 2*f+1; 
if (s+1 <= i-1 & x[s] < x[s+1]) 
S$ = Stl; 
if (s > i-1) 
s=-l; 
} /* endewhile */ 
x[f] = ivalue; 
} /* end for */ 
} /* end heapsort */ 


To analyze the heapsort, note that a complete binary tree with 7 nodes (where 7 is 
one less than a power of two) has log (n + 1) levels. Thus if each element in the array 
were a leaf, requiring it to be filtered through the entire tree both while creating and 
adjusting the heap, the sort would still be O(ı log n). 

In the average case the heapsort is not as efficient as the quicksort. Experiments 
indicate that heapsort requires twice as much time as quicksort for randomly sorted 
input. However, heapsort is far superior to quicksort‘in the worst case. In fact, heapsort 
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remains O(n log n) in the worst case. Heapsort is also not very efficient for small n be- 
cause of the overhead of initial heap creation and computation of the location of fathers 
and sons. - ` : 

The space requirement for the heapsort (aside from array indices) is only one ad- 
ditional record to hold the temporary for switching, provided the array implementation 
of an almost complete binary tree is used. sa 


EXERCISES 


6.3.1. Explain why the straight selection sort is more efficient than the bubble sort. 

6.3.2. Consider the following quadratic selection sort: Divide the n elements of the file into 
Jn groups of /n elements each. Find the largest element of cach group and insert it 
into an auxiliary array. Find the largest of the elements in this auxiliary array. This is 
the largest element of the file. Then replace this element inthe array by the next largest 

+ element of the group.trom which it came. Again find the largest element of the auxiliary 
array. This is the second largest element of the file. Repeat the process until the file has 
been sorted. Write a C routine to implement a quadratic selection sort as efficiently as 


possible. 
6.3.3. A tournament is an almost-eomplete strictly binary tree in which each nonleaf contains 
the larger of the two elements in it sons. Thus the contents of a tournament's leaves 


completely determine the contenis of all its nodes. A tournament with n leaves represents 

a set of n elements. _ ta ee 

(a) Develop an algorithm pginsert(t, n, elt) to add a new element elr to a tournament 

; containing n leaves represented implicitly by an array r. 

(b) Develop an algorithm pgmaxdelete(t.n) to delete the maximum element trom a 
tournament with n elements by replacing the leaf containing the maximum element 
with a dummy value smaller than any possible element (for example, —1 ina 
tournament of nonnegative integers) and then readjusting all values in the path 
from that leaf to the root. 

(c) 4 Show how to simplify pgmaxdelete-by maintaining a pointer to a leaf in each non- 
leaf info field, rather than an actual element value. . 

(d) Write a C program to implement a selection sort using 2 tournament. The prepro- 

i cessing phase builds the initial tournament from the array x and the selection phase 
applies pymaxdelete repeatedly. Such a sort is called a tournament sort. 

(e) How does the efficiency of the tournament sort compare with that of the heapsort? 

(f) Prove that the tournament sort is O(n log n) for all input. 

` 6.3.4. Define an almost complete ternary tree as a tree in which every node has at most three 
sons, and in which the nodes can be numbered trom 0 ton — 1. so that the sons of node{i) 
are node|3 * i + 1). node[3 * i + 2), and node[3 * i + 3]. Define a ternary heap as an 
almost complete ternary tree in which the content of each node is greater than or equal to 
the contents of all its descendants. Write a sorting routine similar to the heapsort using 

a ternary heup. 

6.3.5. Write a routine combine(x) that accepts an array x in which the subtrees rooted at x{1] 
‘and x{2] are heaps and that modifies the array x so that it represents a single heap. 
6.3.6. Rewrite the program of Section 5.3 that implements the Huffman algorithm so that the 

Set of root nodes forms a priority queue implemented by an ascending heap. 
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6.3.7. Write a C program that uses an ascending heap to.merge n input files, each sorted in 
ascending order, into a single output file. Each node of the heap contains a file number: 
and a value. The value serves as the key by which the heap is organized. Initially, one 

- value is read from each file, and the n values are formed into an ascending heap. with 
the file number from which each value came kept together with that value in a node. 
The smallest value is then in the root of the heap and it is the output, with the next valuc 
of its associated file input to take its place. That value, together with its associated file 
number, is sifted down to find its proper place in the heap, and the new root value is 
output. This process oLoutput/inpu/siftdown is repeated until no input remains. 

6.3.8. Develop an algorithm using a heap of k elements to find the wer k numbers in a Rao 
unsorted file of n numbers. 


6.4 INSERTION SORTS 
Simple Insertion 


An insertion sort is one that sorts a set of reċords by inserting records into an 
existing sorted file. An example of a simple insertion sort is the following procedure: 


void insertsort(int x[], int n) 
{ 
int i, k, y; 


/* initially x[0] may be thought of as a sorted file of */ 
/* one element. After each repetition of the following */ 
/* loop, the elements x[{0] through x[k] are in order, */ 
for (k= 1; k <n; k++) { 

/* Insert x[k] into the sorted file */ 

y = xik]; 

/* Move down 1 position all elements greater than y */ 

for (i = k-1; i >= 0 & y < x[i]; i--) 

x[i+1] = x[i]; 

/* Insert y.at proper position */ 

x[i+1] = y; 
}. /* end for */ : ' 

} /* end insertsort */ 3 


As we noted at the beginning of Section 6.3, the simple insertion sort may be viewed 
as a general selection sort in which the priority queue is implemented as an ordered 
array. Only the preprocessing phase of inserting the elements into the priority queue 
is necessary; once the elements have been inserted, they are maz sorted, so that no 
selection is necessary. 

If the initial file is sorted, only one comparison is made on each pass, so that the 
sort is O(n). If the file is initially sorted in the reverse order, the sort is O(n), since the 
total number of comparisons is ` 


(n= 1) #(n—2)+-°-4+34241 =(n-1)*n/2- 
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which is O(n?), However, the simple insertion sort is still usually better than the bubble 
sort. The closer the file is to sorted order, the more efficient the simple insertion sort-be- 
comes. The average'number of comparisons in the simple insertion sort (by considering 
all possible permutations of the input array) is also O(n?). The space requirements for 
the sort consist of only one temporary variable, y. 

The speed of the sort can be improved somewhat by using a binary search 
(see Sections 3.1, 3.2, and 7.1) to find the Proper position*for x[k] in the sorted file 
x{0},.... xik — 1]. This reduces the total number of comparisons from O(n?) to 
O(n log n). However, even if the correct position i for x[k] is found in O(log n) steps, 
each of the elements x[i + 1],.... x|k — 1] must be moved one position. This latter oper- 
ation performed n times requires O(n?) replacements. Unfortunately, the binary search 
technique does not, therefore, significantly improve the overall time requirements of 
the sort. 

Another improvement to the simple insertion sort can be made by using list inser- 
tion. In this method there is an array link of pointers, one for each of the original array 
elements, Initially link{i] = i + } for0 <= i < n— land link[n — 1] = —1. Thus the 
array may be thought of as a linear list pointed to by.an external pointer first initialized 
to 0. To insert the kth element the linked list is traversed unti] the proper position for 
x{k] is found, or until the end of the list is reached. At that point x[k] can be inserted into 
the list by merely adjusting the list pointers without shifting any elements in the array. 
This reduces the time required for insertion but not the time required for searching for 
the proper position. The space requirements are also increased because of the extra link 
array. The number of comparisons is still O(n), although the number of replacements 
in the link array is O(n). The list insertion sort may be viewed as a general selection 
sort in which the priority queue is represented by an ordered list. Again, no selection is 
needed because the elements are sorted as soon as the preprocessing, insertion phase is 
complete. You are asked to code both the binary insertion sort and the list insertion sort 
as exercises. F 

Both the straight selection sort and the simple insertion sort are more efficient 
than bubble sort. Selection sort requires fewer assignments than insertion sort but more 
comparisons. Thus selection sort is recommended for small files when records are large, 
so the assignment is inexpensive, but keys are simple., so that comparison is cheap. If 
the reverse situation holds. insertion sort is recommended. If the input is initially in a 
linked list, list insertion is recommended even if the records are large, since no data 
movement (as opposed to pointer modification) is required. 

Of course. heapsort and quicksort are both more efficient than insertion or selec- 
tion for large n. The break even point is approximately 20-30 for quicksort; for fewer 
than 30 elements use insertion sort: for more than 30 use quicksort. A useful speedup of 
quicksort uses insertion sort on any subfile of size less than 20. For heapsort, the break 
even point with insertion sort is approximately 60-70. 


Shell Sort 


More significant improvement on simple insertion sort than binary or list insertion 
can be achieved by using the Shell sort (or diminishing increment sort), named after 
its discoverer, This method sorts separate subfiles of the original file. These subfiles 
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contain every kth element of the original file. The value of k is called an increment. 
For example, if k is'5, the subfile consisting of x[0], x[5}, x[10]. ... is first sorted. Five 
subfiles, cach containing one fifth of the elements of the original file are sorted in this 
manner. These are (reading across) 


Subfile 1 -> FO.  x{5]} x710) 
Subfile 2 -> x1] x16) xim) 
Subfile 3 -> x{2] x{7}. — xf12) 
Subfile 4 one ea) x{8) xf 13] 
Subtile 5 -> x[4} x{9] xf 14] 


The ith element of the jth subfile is x{(i - 1)* 5 + j- 1). If a different increment & is 
chosen, the k subfiles are divided so that tne ith element of the jth subfile is x[íi — 1) * 
k+j- Ñ. 

After the first k subfiles are sorted (usually by simple insertion). a new smaller 
value of k is chosen and the file is again partitioned into a new set of subfiles. Each 
of these larger subfiles is sorted and the process is repeated yet again with an even 
smaller value of k. Eventually, the value of k is set to 1 so that the subfile consisting of 
the entire file is sorted. A decreasing sequence of increments is fixed at the start of the 
entire process. The last value in this sequence must bel. 

For example, if the original file is 


25 57 48 37 12 92 86 33 


and the sequence (5,3,1) is chosen, the following subfiles are sorted on each iteration: 
First iteration (increment = 5) 


(x[0], x{5}) 
(x[1], x16]) 
(x[2]. x{7]) 
(x[3]} 
(x[4]) 


Second iteration (increment = 3) 


(x{0}, x{3), x16) 
(xf), x[4], x[7]) 
l2) x15) 


Third iteration (increta = 1) 
(x[0]. xf 1), x12], x{3), x4], x[5]. x16}, x{7}) 


Figure 6.4.1 illustrates the Shell sort on this sample file. The lines underneath 
each array join individual elements of the separate subfiles. Each of the subfiles is sorted 
using the simple insertion sort. 

We present below a routine to implement the Shell sort. In addition to the standard 
parameters x and n. it requires an array incrmnts, containing the diminishing increments 
of the sort. and muminc, the number of elements in the array incrmnts. 
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span = 5 | ; i j 


Pas?) hs i 574 33 3I 12i 92 HB 
span = 3 


enti 9g? ig atza o age 9 88 787 
i span = | 


Sorted 12 725 33°,.. 37 48 57 86 92 
file 


Figure 6.4.1 


void shellsort(int x[], int n, int incrmnts[], int numinc) 


{ 


int incr, j, k, span, y; 


for (incr = 0; incr < numinc; incr++) { 
/* span is the size of the increment */ 
span = incrmnts[incr]; 
for (j = span; j < n; j++) f 
/* Insert element x[j] into its proper */ 
"/* position within its subfile =, */” 
y = x[j]; 
for (k = j-span; k >= 0 & y < x[k]; k -= span) 
x[k+span] = x[k]; * s 
x[k+span] = y; 
}. /* end for */ 
}./* end for */ 
} /* end shellsort */ 


Be sure that you can trace the actions of this program on the sample file of Figure 
6.4.1. Notice that on the last iteration, where span equals 1, the sort reduces to a simple 
insertion. i ' : 
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The idea behind the Shell sort is a simple one. We have already noted that the 
simple insertion sort is highly efficient on a file that is in almost sorted order. It is also 
important to realize that when the file size n is small, an O(n?) sort is often more efficient 
than an O(n log n) sort. The reason for this is that O(n?) sorts are generally quite simple 
to program and involve very few actions other than comparisons and replacements on 
each pass. Because of this low overhead, the constant of proportionality is rather small. 
An O(n log n) sort is generally quite complex and employs a large number of extra 
Operations on each Pass in order to reduce the work of subsequent passes. Thus its 
constant of proportionality is larger. When n is large, n? overwhelms n*log (n); so that 
the constants of Proportionality do not play a major role in determining the faster sort. 
However, when n is small, n? is not much larger than n*Jog (7), so that a large difference 
in those constants often causes an O(n?) sort to be faster. 

Since the first increment used by the Shell sort is large, the individual subfiles are 
quite small, so that the simple insert on sorts on those subfiles are fairly fast. Each sort of 
a subfile causes the entire file to be more nearly sorted. Thus, although successive passes 
of the Shell sort use smaller increments and therefore deal with larger subfiles, those 
Subfiles are almost sorted due tothe actions of previous pas. Thus. the insertion sorts 
on those subfiles are also quite efficient. In this connection, it is significant to note that if 
a file is partially sorted using an increment k and is subsequently partially sorted using 
an increment j, the file remains Partially sorted on the increment k. That is. subsequent 
partial sorts do not disturb earlier ones. 

The efficiency analysis of the Shell Sort is mathematically involved and beyond 
the scope of this book. The actual time requirements for a specific sort depend on the 
number of elements in the array incrmnts and on their actual values. One requirement 
that is intuitively clear is that the elements of incrmnrs should be relatively prime (that 
is, have no common divisors other than 1). This guarantees that successive: iterations 
intermingle subfiles so that the entire file is indeed almost sorted when span equals | 
on the last iteration. 

It has been shown that the order of the Shell sort can be approximated by 
O(n(log n)*) if an appropriate sequence of increments is used. For other series of 
increments, the running time can be proven to be O(n'*), Empirical data indicates that 
the running time is of the form a * n’, where a is between 1.1 and 1.7 and b js approxi- 
mately 1.26, or of the form c * n * (In(n))? — d*n» In(n}, where c is approximately 0.3 
and d is between 1.2 and 1.75. In general the Shell sort is recommended for moderately 
sized files of several hundred elements. 

Knuth recommends choosing increments as follows: define a function h recur- 
sively so that A(1) = 1 and hli +1) = 3* h(i) + 1. Let x be the smallest integer such 
that h(x) = n, and set numinc. the number of increments, to x — 2 and incrmnts|i] to 
A(numine — i + 1) for i from 1 to numinc. 

A technique similar to the Shell sort can also be used to improve the bubble sort. 
In practice, a major source of the bubble Sort’s inefficiency is not the number of com- 
parisons but the number of interchanges. If a series of increments are used to define 


overhead, works well in practical situations. 
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Address Calculation Sort 


Asa final example of sorting by insertion, consider the following technique called 
sorting by address calculation (sometimes called sorting by hashing). In this method 
a function f is applied to each key. The result of this function determines into which 
of several subfiles the record is to be placed. The function should have the property 
that if x < y,f(x) = f(y). Such a function is called order-preserving. Thus all of the 
records in one subfile will have keys that are less than or equal to the keys of the records 
in another subfile. An item is placed into a subfile in correct sequence by using any 
sorting method; simple insertion is often used. ‘After all the items of the original file 
have been placed into subfiles, the subfiles may be concatenated to produce the sorted 
result. 

For example, consider again the sample file 


25, 57, 48.,37..,12,..92,, 86; 33 


Let us create ten subfiles, one for each of the ten possible first digits. Initially, each of 
these subfiles is empty. An array of pointers f[10} is declared, where Ff [i] points to the 
first tlement in the file whose first digit is i. After scanning the first element (25) it is 
placed into the file headed by [2]. Each of the subfiles is maintained as a sorted linked 
list of the original array elements. After processing each of the elements in the original 
file. the subfiles appear as in Figure 6.4.2. 

We present a routine to implement the address calculation sort. The routine as- 
sumes an array of two-digit numbers and uses the first digit of each number to assign 
that number to a subfile. 


#define NUMELTS ... 
addr(int x{], int n) 


int f[10], first, i, je Py Yi 
struct { 

int info; 

int next; 
} node [NUMELTS] ; 


/* Initialize available linked list +y 
int avail = 0; 

for (i = 0; i <n-1; i+) 

a- node[i).next = i+1; 

node[n-1].next = -1; 

/* Initialize pointers */ 

for (i= 0; i< 10; i++) 

tO)» -1 


for (i = 0; i < ny i+) { 


/* We successively insert each element into its */.. 
/* respective subfile using list insertion. *7 
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FO) = null 


Bae 


F(4) 

F(6) = null 

F(7) = null 

ro —~ [a To] 


Figure 6.4.2 Address calculation sort. 


y = x[i]; l 
first = y/10; /* Find the 1st digit of a two digit.number */ 
/* Search the linked list */ 
place (&f[first], y); 
/* place inserts y into its proper position */ 
/* in the linked list pointed to by ee bel A 

} /* end for */ 


/* Copy numbers back into the array x If 
i = 0; 
for (j = 0; j < 10; j++) { 

p = f[j); 


while (p != -1) { 
x[i++] = node[p]. info; 
Pp = node[p].next; 
} /* end while */ 
} /* end for */ 


} /* end addr */ 
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The space requirements of the address calculation sort are approximately 2 * n 
(used by the array node) plus some header nodes and temporary variables. Note that if 
the original data is given in the form of a linked list rather than as a sequential array, it 
is not necessary to maintain both the array x and the linked structure node. 

To evaluate the time requirements for the sort, note the following: If the n origi- 
nal elements are approximately uniformly distributed over the m subfiles and the value 
of n/m is approximately 1, the time of the sort is nearly O(n), since the function as- 
signs each element to its proper file and little extra work is required to place the el- 
ement within the subfile itself. On the other hand, if n/m is much larger than 1, or if 
the original file is net uniformly distributed over the m subfiles, significant work is 
required to insert an’ element into its proper subfile, and the time is therefore closer 
to O(n*). 


EXERCISES 


6.4.1, The two-way insertion sort is a modification of the simple insertion sort as follows: A 
separate output array of size n is set aside. This output array acts as a circular structure 
as in Section 4.1. x[0] is placed into the middle element of the array. Once a contiguous 
group of elements are in the array, room for a new element is made by shifting all smaller 
elements one step to the left or all larger elements one step to the right. The choice of 
which shift to perform depends on which would cause the smallest amount of shifting. 
Write a C routine to implement this technique. 


6.4.2, The merge insertion sort proceeds as follows: 


Step 1: For all even i between 0 and n — 2, compare x[i] with x[i + 1]. Place the larger in 
the next position of an array large and the smaller in the next position of an array small. 
If n is odd, place x[n — 1] in the last position of the array small. (Large is of size ind, 
where ind = (n — 1)/2; small is of size ind or ind + 1, depending on whether n is even 
or odd.) i 

Step 2: Sort the array /arge using merge insertion recursively. Whenever an element 
large{j] is moved to large{k], small{j] is also moved to small[k). (At the end of this step, 
large{i] <= large{i + 1] for all i less than ind. and small{i] <= large[i)-for all i less 
than or equal to ind. 

Step 3: Copy small{0] and all the elements of large into x{0] through x{ind]. 

Step 4: Define the integer num{i) as (2'*' + (—1)')/3. Beginning with i = 0 and pro- 
ceeding by 1 while num[i] <= (n/2) + 1, insert the elements small{num[i + 1}] down 
to small{num{i] + 1] into x in turn, using binary insertion. (For example, if n = 20, 
the successive values of num are num[O}] = 1, num[{}] = 1, num[2] = 3, num[3] = 5, 
and num[4] = 11, which equals (n/2) + 1. Thus the elements of small are inserted in 
the following order: smal/(2], small{1); then small/(4}, small{3); then smali(9), smaill[8), 
small{7}, small[6], small{5]. In this example, there is no small{10}.) 


Write a C routine to implement this technique. 


6.4.3. Modify the quicksort of Section 6.2 so that it uses a simple insertion sort when a subfile 


is below some size s. Determine by experiments what value of s should be used for 
maximum efficiency. 
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644. Prove that if a file is partially sorted using an increment j in the Shell sort, it remains 


partially sorted on that increment even after it is partially sored on another incre- 
mem, k. 


s 
6.4.5. Explain why it is desirable to chouse all the increments of the Shell sort so that they are 
relatively prime. 


6.4.6. What is the number of comparisons and interchanges (in terms of file size n) performed 
by each of the following sorting methods (a-j) for the following files: 


1. 
2. 
3. 


(a) 
(b) 
(c) 
(d) 
(e) 
(f) 
t) 
(h) 
(i) 
Q) 


A sorted file 

A file that is sorted in reverse order (that is, from largest to smallest) 

A file in which the elements x{0], x[2], x{4], ... are the smallest elements and are in 
sorted order, and in which the elements x{1}, x[3], x[5],... are the largest elements 
and are in reverse sorted order (that is, x[0] is the smallest, «{1] is the largest, a{2J 
is next to smallest, [3] is the next to the largest, and so on) 

A file in which x{0] through x[ind] (where ind = (n — 1)/2) are the smallest el- 


_ ements and are sorted, and in which x{ind + 1] through xin — 1) are the largest 


elements and are in reverse soried order 


A file in which x{0}, x{2], x[4], ... are the smallest elements in sorted order, and in 
which x{1), x{3}, x[5],...are the largest elements in sorted order 


Simple insertion sort 

Insertion sort using a binary search 

List insertion sort 

Two-way insertion sort of Exercise 6.4.1 
Merge insertion sort of Exercise 6.4.2 

Shell sort using increments 2 and | 

Shell sort using increments 3, 2, and 1 

Sheil sort using increments 8, 4, 2, and 1 
Shell sort using Increments 7, 5, 3, and 1 
Address calculation sort presented in the text 


6.4.7. Under what circumstances would you recommend ‘the use of each of the following sorts 
over the others? 


(a) 
(b) 
(c) 


Shell sort of this section 
Heapsort of Section 6.3 
Quicksort of Section 6.2 


6.4.8. Determine which of the following sorts is most efficient. 


(a) 
(b) 
(c) 


Simple insertion sort of this section 
Straight selection sort of Section 6.3 
Bubble sor of Section 6.2 


6.5 MERGE AND RADIX SORTS 


Merge Sorts 


a” 


Merging is the process of combining two or more sorted files into a third sorted 
file. An example of a routine thataccepts two sorted arrays u and b of nl and n2 ele- 
ments, respectively, and merges them into a third array c containing n3 elements is the 


following: 
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void mergearr(int a[], int-b(), int c[], int nl; int n2, int n3) 
{ 2 ; 

int apoint, bpoint, cpoint; 

int alimit, blimit, climit; 


alimit = nl-1; 
blimit = n2-1; 
climit = 'n3~-1; 
if (nl + n2 != n3) { 
printf("array bounds incompatible/n"); 
exit(1); 
}-/* end if */ 
/* apoint and bpoint are indicators of how far */ 
/* we are in’arrays a and b respectively. af 
apoint = 0; 
bpoint = 0; 
for (cpoint = 0; apoint <= alinit && bpoint <= blimit; cpoint++) 
if (alapoint] < b[bpoint}) 
c{cpoint] = a({apoint++]; 


else 
c[cpoint] = b(bpoint++]; 
5 while (apoint <= alimit) 


c[cpoint++] = a[apoint++]; 
while (bpoint <= blimit) 
c{cpoint++] = b[bpoint++]); 
} /* end mergearr */ 


We can use this technique to sort a file in the following way. Divide the file into 
n subfiles of size 1 and merge adjacent (disjoint) pairs of files. We then have approxi- 
mately n/2 files of size 2. Repeat this process until there is only one file remaining of size 
n. Figure 6.5.1 illustrates how this process operates on a sample file. Each individual 
file is contained in brackets. 

We present a routine to implement the foregoing description of a straight merge 
sort. An auxiliary array aux of size n is required to hold the results of merging two 
subarrays of x. The variable size contains the size of the subarrays being merged. Since 
at any time the two files being merged are both subarrays of x, lower and upper bounds 
are required to indicate the subfiles of x being merged. /] and ul represent the lower 
and upper bounds of the first file, and /2 and u2 represent the lower and upper bounds 
of the second file, respectively. i and j are used to reference elements of the source files 
being merged, and k indexes the destination file aux. The routiné follows: ©” 


#define NUMELTS ... 
void mergesort(int x[], ‘int ñ) 
Wy 


«5 Int, aux[NUMELTS]; i, j, k, 11,112, :size;púl,; -u2; 
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Figure 6.5.1 Successive passes.of the merge sort. 
t 
size = 1;)\/* iMerge files of size p Ey 77u 45 OP 


while (size < n) { aT 
Al 3,0; /> Initialize» lower bounds of Fani fi Te wj 
k ra PT. ft kyis index for;auxiliaryarray.) (47 © 
Ardi Cissize <n)f, /* Check to see.if there! */) i 
/*are two files aink Khe 
/* Compute remaining. indices “y, om 9 i 
12 = Ti+size; 
ul = 12-1; j 
u2? = (12+size-1 < n) ?'12+size-1 : 
7* Proceed through ‘the two” aibHies'« a 
for (i= 11, j= 12; ee ul Gb j e u2; +35 
/* Enter! smaNer- “into ed array aux Mi as 
olf (xfi] <= x)» E i 
aux[k]: = nies]: i ON BIL 
nelse, ? x 
ieman aux). [jel idnil rok 
Je At this point, .one. of the ‘subfiles Kf}: 
/* has been exhausted. Insert any. f/ 
i* remaining portions of the other file */ 
for (; i <= ul; k++) 
: aux(k] = x(i++]; ngs 
xo “for (5 <= u; Kay!” i i. 
! aux[k] = x[j+)]; 
/* Advance 11 to the Start of the next sgh ‘of files. j 
Vos u2+1; 
a }/*. end while */ 


new 
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/* Copy any remaining single file */ 
for (i = 11; k < n; i++) 
aux(k++] = x[i]; 
/* Copy aux into x and adjust size */ 
for (i = 0; i <n; i++) 
x[i] = aux[i]; 
size *= 2; 
} /* end while */ 
} /* end mergesort */ 


There is one deficiency in the foregoing procedure that is easily remedied if the 
program is to be practical for sorting large arrays. Instead of merging each set of files 
into the auxiliary array aux and then recopying the array aux into x, alternate merges 
can be performed from x to aux and from aux to x. We leave this modification as an 
exercise for the reader, 

There are obviously no more than log? n passes in merge sort, each involving n or 
fewer comparisons. Thus, mergesort requires no more than n = log2 n comparisons. In 
fact, it can be shown that mergesort requires fewer than n * logs m— n+ l comparisons, 
on the average, compared with 1.386 * n * log. n average comparisons for quicksort. In 
addition, quicksort can require O(n?) comparisons in the worst case, whereas mergesort 
never réquires more than n * logy n. However, mergesort does require approximately 
twice as many assignments as quicksort on the average, even if allernating inerges go 
from x to aux and from aux to x. 

Mergesort also requires O(n) additional space for the auxiliary array, whereas 
quicksort requires only O(log n) additional space for the stack. An algorithm has been 
developed for an in-place merge of two sorted subarrays in O(n) time. This algorithm 
would allow mergesort to become an in-place O(n log n) sort. However. that technique 
does require a great deal many more assignments and would thus not de as practical as 
finding the O(n) extra space. 

There are two modifications of the foregoing procedure that can result in more ef- 
ficient sorting. The first of these is the natural merge. In the straight merge. the files are 

„all the same size (except perhaps for the last file). We can, however, exploit any order 
that may already exist among the elements and let the subfiles be defined as the longest 
subarrays of increasing elements. You are asked to code such a routine as an exercise. 

The second: modification uses linked allocation instead of sequential allocation. 

- By adding a single pointer field to each record, the need for the second array aux can be 
eliminated. This can be done by explicitly linking together each input and output sub- 
file. The modification can be applied to both the straight merge and the natural merge. 
You are asked to implement these in the exercises. 

Note that using mergesort on a linked list eliminates both of its drawbacks relative 
to quicksort: lt no longer requires significant additional space and does not require sig- 
nificant data element movement. Generally, data elements can be large and coniplex, so 
that assignment of data elements requires more work than the reassignment of pointers 
that is still required by a list-based mergesort. . 

Mergesost can also be presented quite naturally as a recursive process in which 
the two halves of the array are first recursively sorted using mergesort and, once sorted, 
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are joined by merging. For details, see Exercises 6.5.1 and 6.5.2. Both mergesort and 
quicksort are methods that involve splitting the file into two parts, sorting the two parts 
separately, and then joining the two sorted halves together. In mergesort, the splitting is 
trivial (simply taking two haives) and the joining is hard (merging the two sorted files). 
In quicksort, the splitting is hard (partitioning) and the joining is trivial (the two halves 
and the pivot automatically form a sorted array). 

Insertion sort may be considered a special case of mergesort in which the two 
halves consist of a single element and the remainder of the array. Selection sort may 
be considered a special case of quicksort in which the file is partitioned into one half 


consisting of the largest element alone and a second half consisting of the remainder of 
the array. 


The Cook-Kim Algorithm 


: Frequently, it is known that a file is almost sorted with only a few elements out of 
order. Or it may be known that an input file is likely to be sorted. For small files that are 
very nearly sorted or for sorted files, simple insertion is the fastest sort (considering both 
comparisons and assignments) that we have encountered. For large files or files that are 
slightly Jess sorted, quicksort using the middle element as pivot is fastest. (Considering 
only comparisons, mergesort is fastest.) However, another hybrid algorithm discovered 
by Cook and Kim is faster “han both insertion sort and middle-element quicksort for 
nearly sorted input. ; i 

The Cook-Kim algorithm operates as follows: The input is examined for un- 
ordered pairs of elements (for example, x[k] > x{k + 1]). The two elements in an 
unordered pair are removed and added to the end of a new array. The next pair ex- 
amined after an unordered pair is removed consists of the predecessor and successor 
of the removed pair. The original array, with the unordered pairs removed, is now in 
sorted order. The array of unordered pairs is then sorted using middle-element quicksort 
if it contains more than 30 elements, and simple insertion otherwise. The two arrays are 
then merged. ' 

The Cook—Kim algorithm takes more advantage of the sortedness of the input than 
any other sorts and is significantly better than middle-element quicksort, insertion sort, 
merge sort, or Bsort on nearly sorted input. However, for randomly ordered input, Cook- 
Kim is less efficient than Bsort (and certainly than quicksort or merge sort). Middle- - 
element quicksort, merge sort, or Bsort is therefore preferable when large sorted input 
files are likely but good random-input behavior is also required. 


: Radix Sort 


The next sorting method that we consider is called the radix sort. This sort is 
based on the values of the actual digits in the positional representations of the numbers 
being sorted. For example, the number 235 in decimal notation is written with a 2 in the 
hundreds position, a 3 in the tens pasition, and a 5 in the units position. The larger of two 
such integers of equal length can be determined as follows: Start at the most-significant 
digit and advance through the least-significant digits as long as the corresponding digits 
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in the two numbers match. The number with the larger digit in the first position in which 
the digits of the two numbers do not match is the larger of the two numbers, Of course, 
if all the digits of both numbers match, the numbers are equal. - 

~ We can write a sorting routine based on the foregoing method, Using the decimal 
base, for example, the numbers can be partitioned into ten groups based on their most- 
significant digit. (For simplicity, we assume that all the numbers have the same number 
of digits, by padding with leading zeros, if necessary.) Thus every element, in the “0” 
group is less than every element in the “1” group, all of whose elements are less than 
every element in the “2” group, and so on. We can then sort within the individual groups 
based on the next significant digit. We repeat this process until each subgroup has been 
subdivided so that the least-significant digits are sorted. At this point the origina! file 
has been sorted. (Note that the division of a subfile into groups with the same digit in a 
given position is similar to the partition operation in‘the quicksort, in which a subfile is 
divided into two groups based on comparison with a particular element.) This method 
is sometimes called the radix-exchange sort, its coding is left as an exercise for the 
reader. 

Let us now consider an alternative to the foregoing method. It is apparent from 
the foregoing discussion that considerable bookkeeping is involved in constantly sub- 
dividing files and distributing their contents into subfiles based on particular digits. It 
would certainly be easier if we could process the entire file as a Whole rather than deal 
with many individual files. : 

Suppose’ that we perform the following actions ön the file for each digit, beginning 
with the Jeast-significant digit and ending with the most-significant digit. Take each 
number in the order in. which it appears in the file and place it into one of ten queues, 
depending on the value of the digit currently being processed. Then restore each queue 
to the original file starting with the queue of numbers with a 0 digit and ending with 
the queue of numbers with a 9 digit When these actions have been performed for each 
digit, starting with the least significant and‘ending with the most significant, the file is 
sorted. This sorting method is called the radix sort. — ; 

Notice that this scheme sorts on the less-significant digits first. Thus when all the 
numbers are sorted on a more significant digit, numbers that have the same digit in that 
position but different digits in a less-significant position are already sorted on the less- 
significant position. This allows processing of the entire file without subdividing the 
files and keeping track of Where each subfile begins and ends. Figure 6.5.2 illustrates 
this sort on the saniple file > . ; 


255-97), 48 alur de uI- 


Be sure that you can follow the actions depicted in the two passes of Figure 6.5.2. 
We can therefore outline an algorithm to sort in the foregoing fashion as follows: 


for (k = least significant digit; k <= most significant digit; k++) { 
for (i. = 03,7. < ny. i++) { 
y = x[i]; 
j.= kth digit of y; 
place y at rear of queve[j]; 
} /* end for */ 
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Original file 
25 57 48 37 12 92 Ro 33 
Queues based on least significant digit. 


Front Rear 
queue [0] 
queue {1} 
queue (2) 12 92 
queue |3] 33 
queue [4] 
queue [5] 25 
queue |6) 86 
queue {7} 57 37 
queue [8] 48 
queue (9) 
After first pass: 


12 92 33 25 86 57 37 48 


Queues based on most significant digit. 


Front Rear À 
queue [O) 
queue |1) 12 
queue |2] 25 
queuc (3} 33 37 
queue [4] 48 
queue [5] 57 
queue (6) ` 
queue {7} 
queue [8] 86 i ; 
queue [9] 92 v 


Sorted file: 12 25 33 37 48 57 86 92 


Figure 6.5.2 Illustration of the radix sort. 


for (qu = 0; qu < 10; qu++) 
place elements of queue[qu] in next sequential position of x; 
} /* end for */ 


We now present a program to implement the foregoing sort on m-digit numbers. 
“In order to save a considerable amount of work in processing the queues (especially in 
the step where we return the queue elements to the original file) we write the program 
using linked allocation. If the initial input to the routine is an array, that input is first 
converted into a linear linked list; if the original input is already in linked format, this 
step is not necessary and, in fact, space is saved. This is the same Sitation as in the 
routine addr (address calculation sort) of Section 6.4. As in previous programs, we do 
not make any internal calls to routines but rather perform their actions in place. 
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#define NUMELTS ... 


void radixsort(int x[], int n) 
{ 
int front(10], rear{10}; 
struct { 
int info; 
int next; 
} node [NUMELTS]; 
int exp, first, i, j, k, p,q. ys 
/* Initialize linked list */ 
for (i = 0; i < n-1; i++) { 
node[i].info = x[i]; 
node[i].next = i+1; 
} /* end for */ 
node(n-1].info = x[n-1); 
node{n-1].next = -1; , 
+ first = 0; /* first is the head of tne tinked list */ 
for (k = 1; k < 5; k++) { 
/* Assume we have four-digit numbers */ 
for (i = 0; i < 10; i++) { 
/* Initialize queues */ 
rear{i] = -1; 
front{i] = -1; 
} /* end for */ 
/* Process each element on the list */ 
while (first != -1) { © 
p = first; 
first = node[first].next; 
y = node[p}. info; 
/* Extract the kth digit */ 
exp = power(10,k-1); /* raise 10 to (k-1) th power */ 
j = (y/exp)%10; 
/* Insert y into queue[j] */ 
q = rear[j); 
if (q = -1) 
front[j] = p; 
else ; 
node[q].next = p; 
rear[j] = p; 
} /* end while */ 
* form a single jist from all the queue elements. Find the first element. 
or (j = 0; j < 10 & front[j] == -1; j++) 


first = front[j]; 
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/* At this point each record is in its proper queue based on digit k. We now */ 


/* Link up remaining queues */ 
while (j <= 9) {  /* Check if finished */ 
/* Find the next element */ 
for (i = j+1; i x 10 & front[i] == -1; i++) 


if (i <= 9) { 

p=1; 

node(rear{j]].next = front[i]; 
} /* end if */ 


per 
} /* end while */ 
node{rear(p]}.next = -1; 
} /* end for */ 
/* Copy back to original array */ 
for (i = 0; i < n; i++) { 
x[i] = node[first].info; 
first = node[first] .next; 
} /* end for */ 
} /* end radixsort */ 


The time requirements for the radix sorting method clearly depend on the 
number of digits (m) and the number of elements in the file (n). Since the outer 
loop for (k = l; k <= m: k++) is traversed m times (¿nce for each digit) and 
the inner loop 7 times (once for each element in the file), the sort is approximately 
O(m * n). Thus the sort is reasonably efficient if the number of digits in the keys is not 
too large. It should be noted, however that many machines have the hardware facilities 
to order digits of a number (particularly if they are in binary) much more rapidly than 
they can execute a compare of two full keys. Therefore it is not reasonable to com- 
pare the O(m * n) estimate with some of the other results we arrived at in this chapter. 
Note also that if the keys are dense (that is, if almost every number that can possi- 
bly be a key is actually a key), m approximates log n, so that O(m * n) approximates 
O(n logn). The sort does require space to store pointers to the fronts and rears of the queues 
in addition to an extra field in each record to be used as a pointer in the linked lists. If the 
number of digits is large it is sometimes more efficient to sort the file by first applying the 
radix sort to the most-significant digits and then using straight insertion on the rearranged 
file. In cases where most of the records in the file have differing most-significant digits, 
this process eliminates wasteful passes on the least-significant digits. 


=XERCISES 


6.5.1. Write an algorithm tor a routine merge(x, lb1, ub, ub2) that assumes that.a[/b}] through . 
x{ub1] and x{ubl.+ 1] through x{ub2] are sorted and merges the two into a{/b1] through 
xf{ub2}. ; 

6.5.2. Consider the following recursive version of the merge sort that uses the routine merge 
of Exercise 6.5.1. It is initially called by msort2(x, 0, n — 1). Rewrite the routine by 


eliminating recursion and simplifying. How does the resulting routine differ from the 
one in the text? 
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6.5.3. 


6.5.4, 


6.5.5. 


6.5.6, 


382 


void msort2(int x[]; int 1b, int ub) 
{ 


if (1b != ub) { 
mid = (ub+lb)/2; 
msort2(x, 1b, mid); 
msort2(x, mid+1, ub); 
merge(x, 1b, mid, ub); 
t /* end if */ 
} /* end msort2 */ 


Let a(/1, 12) be the average number of comparisons necessary to merge two sorted arrays 
of length /} and /2. respectively, where the elements of the arrays are chosen at random 
from among /1 + /2 elements. 

(a) What are the values of a(l 1,0) and a(0./2)? 

(b) Show that for JI > O and 12 > 0, alll, 12) is equal to (IATL + 12))* 
Q + al = 1. Rp + (PAN + Ry + alid — 1). (Hint: Express the 
average number of comparisons.in terms of the average number of comparisons 
after the first comparison.) 

(c) Show that a(/1 2) equals (1112x011 + 12+ 2) + 142 + 1)). 

(d) Verify the formula in part ¢ for two arrays, one of size 2 and one of size 1. 

Consider the following method of merging two arrays a and b into c: Perform a binary 

search for }[0] in the array a. If b[0] is between ali] and'afi + 1}, output al 1) through 

a{i} to the array o, then output b[0] to the array c. Next perform a binary search tor b[1] 

in the subarray‘a{i + 1] to alla} (where la is the number of elements in the array a) and 

repeat the output process. Repeat this procedure for every element of the array b. 

(a) , Write a C routine to implement this method. 

(b) In which cases is this method more efficient than the method of the text? In which 
cases is it less efficient? 

Consider the following method (called binary merging) of merging two sorted arrays 
a and b into c: Let lu and Ib be the number of elements of a and b. respectively, and 
assume that la >= łb. Divide a into lb + | approximately equal subarrays. Compare 
b{0] with the smallest element of the second subarray of a. If b[0] is smaller, find afi] 
such that afi] <= b[0)] <= afi + 1] by a binary search in the first subarray. Output 
all elements of the first subarray up to and including afi) into c. and then outprt 4'™ 
into c. Repeat this process with b{1]. b[2],.... bU). where b[j] is found to be larger tian 
the smallest element of the second subarray. Output all remaining elements of the first 
subarray and the first element of the second subarray into c. Then compare b{j} with the 
smallest element of the third subarray of a, and so on. 

(a) Write a program to implement the binary merge. 

(b) Show that if la = lb, thé binary merge acts like the merge described in the text. 

(c) Show that if /b = 1, the binary merge acts like the merge of the previous exercise. 

Determine the number of comparisons (as a function of n and m) that are performed in 

merging two ordered files a and b of sizes n and m, respectively, by exch of the following 

merge methods, on each of the following sets of ordered files: 
Merge Methods: 
(a). the merge method presented in the text 
(b) , the merge of Exervise 6.5.4 
tc) the binary merge of Exercise 6.5.5 


Sorting Chap. 6 


Sets of Files: 
Laman andali] < bli) < ali + 1) for alli 
(b) m= nand ajn) < bil nunan naonaga EEA 
(ce) m = n and a[n/2) < b[1) < bim) < a{(n/2) + 1] 
(d) n = 2*mandali}< bli] < ali + 1] for all i between O and m — | 
(e) n= 2*mandalm+ i) < bli] < aļm+ i+ 1] for all i between 0 and m 
(f) n= 2»m andaf? *i] < bli] < al2*i+ 1! for all i between O and m = 1 r 
(g) m= 1 and b[0] = ain; 2) $ 
(h) m = Land b[0] < al0) | 


i Ai) ITs beh < bhil ; 
6.5.7. cat eae iè Maire 100 and merge them by cach of the methods 
H the previous exercise, keeping track of the number of comparison, made. Do the ok 
for two files of size 10 and two files of size 1000. Repeat the experiment ten times. What 
do the results indicate about the average efficiency of the merge methods? 
Write a routine that sorts a file by first applying the radix sort to the most significant r 
digits (where ris a given constant) and then uses straight insertion to sort the entire file. 
This eliminates excessive passes on low-order digits that may not be necessary. 4 

_ Write a program that prints all sets of six positive integers al, a2. a3, a4, a5, and a6 
SOR ARP TNT eee oe re enamine srna m aas ‘ 


al <= a2 <= a3 <= 20 
al < a4 <= ai <= a6 <= 20 
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and the sum of the squares of al. a2, and a3 equals the sum of the squares of a4, a5, 
and a6, (Hint: Generate all possible sums of three squares, and use a sorting procedure 
to find duplicates.) 
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In this chapter we consider methods of searching large amounts of data to find one par- 
ticular piece of information. As we shall see, certain methods of organizing data make 
the search process more efficient. Since searching is such a common task in computing. 
a knowledge of these methods goes a long way towa:d making a good programmer. 


7.1 BASIC SEARCH TECHNIQUES 


Before we consider specific search techniques, let us define some terms. A table or a file 
is a group of elements, each of which is called a record. Associated with each record is 
a key, which is used to differentiate among different records. The association between a 
record and its key may be simple or complex. In the simplest form, the key is contained 
within the record at a specific offset from the start of the record. Such a key is called 
an internal key or an embedded key. In other cases there is a separate table of keys that 
includes pointers to the records. Such keys are called external. 

For every file there is at least one set of keys (possibly more) that is unique 
(that is, no two records have the same key value). Such a key is called a primary key. 
For example, if the file is stored as an array, the index within the array of an element 
is a unique external key for that element. However, since any field of a record can 
serve as the key in a particular appiication, keys need not always be unique. For exam- 
ple, in a file of names and addressés, if the state is used as the key for a particular search, 
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it will probably not be unique, since there may be two records with the same state 
in the file. Such a key is called a secondary key. Some of the algorithms we present 
assume unique keys; others allow for duplicate keys. When adopting an algorithm for 
a particular application the programmer should know whether the keys are unique and 
make sure that the algorithm selected is appropriate. 

A search algorithm is an algorithm that accepts an argument a and tries to find a 
record whose key is a. The algorithm may return the entire record or, more cominonly, it 
may return a pointer to that record. It is possible that the search for a particular argument 
in a tabie is unsuccessful; that is, there is no record in the table with that argument as 
its key. In such a case the algorithm may return a special “null record” or a nul! pointer. 
Very often, if a search is unsuccessful it may be desirable to add a new record with 
the argument as its key. An algorithm that does this is called a search and insertion 
algorithm. A successful search is often called a retrieval. A table of records in which a 
key is used for retrieval is often called a search table or a dictionary. 

In some cases it is desirable to insert a record with a primary key key into a file 
without first searching for another record with the same key. Such a situation could 
arise if it has already been determined that no such record already exists in the file. 

. In subsequent discussions we investigate and comment upon the relative efficiency of 
various algorithms. In such cases the reader should note whether the comments refer to 
a search, to an insertion, or to a search and insertion. 

Note that we have said nothing about the manner in which the table or file is or- 
ganized. It may be an array of records, a linked list, a tree, or even a graph. Because 
different search techniques may be suitable for different table organizations, a table is 
often désigned with a specific search technique in mind. The table may be contained 
completely in memory, completely in auxiliary storage, or it may be divided between 
the two. Clearly, different search techniques are necessary under these different as- 
sumptions. Searches in which the entire table is constantly in main memory are called 
internal searches, whereas those in which most of the table is kept in auxiliary stor- 
age are called external searches. As with sorting, we concentrate primarily on internal 
searching; however, we mention some techniques of external searching when they re- 
late closely to the methods we study. 


Dictionary as an Abstract Data Type 


A search table or a dictionary can be presented as an abstract data type. We first 
assume two type declarations of the key and record types and a function that extracts 


the key of a record from the record. We also define a null record to represent a failed 
search. 


typedef KEYTYPE ... /* a type of key #/ 
typedef RECTYPE ... /* a type of record */ 
RECTYPE nulirec=... /* a “null” record ey 
KEYTYPE — keyfunct(r) 

RECTYPE r 

Fios 

}; 
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We may then represent the abstract data type table as simply a set of records. 
This is our first example of an ADT defined in terms of a set rather than a sequence. 
We use the notation [eltype] to denote a set of objects of the type eltype. The function 
inset(s, elt) returns true if elt is in set s and false otherwise. The set operation x - y 
denotes the set x with all elements of set y removed. 


abstract typedef [rectype] TABLE (RECTYPE); 


abstract member(tb!,k) 
TABLESRECTYPE) tbl; 
KEYTYPE k; 
postcondition if (there exists an r in th? such that 
keyfunct(r) == k) 
then member = TRUE 
else member = FALSE 


abstract RECTY°E search( th]. k) 
, TASLECRECTYPE) tbl; 
KEYTYPE k; 
postcondition ‘not member( thl, kj) && ferarch == nullrec) 
it (member. tak: && kevtunce(search) == k); 
austract insert! tp?,r) 
F CRECTYFO tei: 
nF r 
precondition mesbent enl xoy 
postcondition isser tbi, r); 
(tb? -f 


abstract deleta: tb}, K) 

TABLECRECTYPE) toi; 

KESTYPC k; 

postcondition tb? == (tbi' - {search(cbi,k}}); 


Because no relation is presumed to exist among the records or their associated 
keys. the table that we have specified is called an unordered table. Although such a 
table allows elements ube retrieved based on their key values, the clements cannot he 
retrieved in a specific order. There are times when. in addition to the facilities provided 
by an unordered table. it is also necessary to retrieve elements based on some ordering 
of their keys, Once an ordering among the records is established. it becomes possible to 
refei to the first element of a table. the last clement cfa table. and the successor of ‘given 
clement. A table that supports these additional facilities is called an ordered table. Vhe 
ADT for an ordered table must be specified as a sequence to indicate the ordering of the 
records rather than as a set. We leave the ADT specification as an exercise for the reader. 


Algorithmic Notation 


Most of the techniques presented t this chapter are presented as algorithms rathe: 
than ts C programs. The reason for this is that a table may be represented in a wide 
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variety of ways, For example, a table (keys plus records) organized as an array might 
be declared by 


édefine TABLESIZF 1000 
typedef KEYTYPE ... 
typedef RECTYPE _. 
struct í 

KEYTYPE k; 

RECTYPE r; : 
z tablelTABLESIz6}; 


or it might de declared as two separate arrays: 


KEY 


REX 


kf SABLESIZE); 
YPE riTALLESIZEJ; 


In the first case the ak key would be referenced as tern. 
Similarly. tora tebie organized as 


eG as 


eP i the second case. as hi} 
a fist. enther the array representation of a hist 
or the dynamic representation of a list could be used. In the former cose t. kev of the 
record pointed to by a pointer p would be referenced as neds ipik: in the Ler case, as 
p=- k. 

However, the techniques for searchine these tables areovers similar Thus. in 
order to free ourselves from the necessiy ci choosing a speentic repress i 
adopt the wigerithmic convention of referencing the jth kev as iut und 
record printed te by p as kup. Simil 


On. We 


the kev ef the 
amy. we reference the correspondme record as 
flor rip} in this wes we can focus on details of technique rther than on details of 
implementation. 


Sequential Searching 


The simpiest form of a scarch is the sequential search, Ths search is uppticable 
to a table organized either as en arra v Or as a tinked list. Ler us asme that & is an 
array of n keys. AG) through Ao = py andr an aray ef recom! n through rin — 1), 
such that AU) is the kev of rG. (Note that we are Using the gigerummic nataten, Aii) 
and ri). as described previously. i Let us also assume that kev isi s argument. We 
wish to return the smallest integer i such that Lid: equals key if such an J exists and — 1 
otherwise. The algorithm for doing this is as follows: 


ty 
n 


for (ï +0; i< nm; ise) 
if (key == k(i)) 
return{i); 
return (-1); 


The algorithm examin>s cach key in ture: upon tinding one that matches the search 
argument, its index (° h acisas pointer to ats records is returned. If no match is 
found, — } is returned. 

This algorithm can be moditied easily to add a record ree with key Key to the table 
if key is not already there. The lust statement is modified to read 
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k(n) = key; /* insert the new key and */ 
r(n) = rec; is record +/ 
n+; /* increase the table size */ 
return(n - 1); 


Note that if insertions are made using the foregoing revised algorithm only, no 
two records can have the same key. When this algorithm is implemented in C, we must 
ensure that incrementing n does not make its value go beyond the upper bound of the 
array. To use a sequential insertion search on an array, sufficient storage must have been 
previously allocated for the array. 

An even more efficient search method involves inserting the argument key at the 


end of the array before beginning the search, thus guaranteeing that the key will be 
found. 


k(n) = key; 
for (i = 0; key != k(i); i++) 


» if (i <n) 
return(i); 
else 
return(-1); 


For a search and insertion, the entire if statement is replaced by 


if (i == n) 
r(n++) = rec; 
return(i); 


The extra key inserted at the end of the array is called a sentinel. 

Storing a table as a linked list has the advantage that the size of the table can be 
increased dynamically as needed. Let us assume that the table is organized as a linear 
linked list pointed to by table and linked by a pointer field next. Then assuming k, r, 
key, and rec as before, the sequential insertion search for a linked list may be written 
as follows: 


q = null; 

for (p = table; p != null && k(p) != key; p = next(p)) 
q= pi 

if (p != null) /* this means that k(p) == key */ 
return (p); 


/* insert a new node */ 
s = getnode(); 
k(s) = key; 
r(s) = rec; 
next(s) = null; 
if (q == null) 
table = s; 
else 
next(q) = 5; 
return(s); 
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The efficiency of searching a list can be improved by the same technique jus. 
suggested for an array. A sentinel node containing the argument key can be added to: 
the end of the list before beginning the search so that the condition in the for loop is the 
simple condition k(p) != key. The sentinel method, however, requires maintaining an 
additional external pointer to the last node in the list. We leave additional details (as, 
for example, what happens to the newly added node when the key is found within the 
list) to the reader. 

Deleting a record from a table stored as an unordered array is implemented by 
replacing the record to be deleted with the last record in the array and reducing the 
table size by 1. If the array is ordered in some way (even if the ordering is not by key), 
this method cannot be used, and half the elements in the array must be moved on the 
average. (Why?) If the table is stored as a linked list, it is quite efficient to delete an 
element regardless of the ordering. 


Efficiency of Sequential Searching 


How efficient is a sequential search? Let us examine the number of comparisons 
made by a sequential search in searching for a given key. We assume no insertions or 
deletions, so that we are searching through a table of constant size n. The number of 
comparisons depends on where the record with the argument key appears in the table. 
If the record is the first one in the table, only one comparison is performed; if the record 
is the last one in the table, » comparisons are necessary. If it is equally likely for the 
argument to appear at any given table position, a successful search will take (on the 
average) (n + 1)/2 comparisons, and an unsuccessful search will take n comparisons. 
In any case, the number of comparisons is O(n). 

However, it is usually the case that some arguments are presented to the search 
algorithm more often than others. For example, in the files of a college registrar, the 
records of a senior who is applying for transcripts for graduate school, or of a freshman 
whose high school average is being updated, are more likely to be called for than those 
of the average sophomore and junior. Similarly, the records of scofflaws and tax cheats 
are more likely to be retrieved from the files of a motor vehicles bureau or the Internal 
Revenue Service than those of a law-abiding citizen. (As we shall see later in this chap- 
ter, these examples are unrealistic because it is unlikely that a sequential search would 
be used for such large files; but for the moment, let us assume that a sequential search 
is being used.) Then if frequently accessed records are placed at the beginning of the 
file, the average number of comparisons is sharply reduced, since the most commonly 
accessed records take the least amount of time to retrieve. 

Let p(i) be the probability that record i is retrieved. (p(i) is a number between 0 
and 1 such that if m retrievals are made from the file, m * p(i) of them will be from 
r(i).) Let us also assume that p(0) + p(1) +- + pin — 1) = 1, so that there is no 
possibility that an argument key is missing from the table. Then the average number of 
comparisons in searching for a record is 


È PO +2 * pl) +3 * p(2) +--+ +n* pin- 1) 
Clearly, this number is minimized if 


PO >= pl) >= p(2) >= --- >= p(n- 1) 
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(Why?) Thus, given a large stable file. reordering the file in order of decreasing prob- 
ability of retrieval achieves a greater degree of efficiency each time that the file is 
searched. 

If many insertions and deletions are to be Pertormed on a table, a list structure 
is preferable to an array. However. even im a list it would be better t9 maintain the 
relationship 

PHO) ce pe] ps >= n= 1) 


to provide for efficient sequential searching. This can be done most easily if a new item 
is inserted into the list at its proper place. E srob is the probability that a record with 
a given key is the search argument, that record should be inserted between records rit} 
and r(i + 1) where is such that 


PU) >= proh `>= mii 1) 


Of course, this method implies that an esira field P is kept with each record or 
that p can be computed based on some other Information in cach record, 


* Reordering a List for Maximum Search Efficiency 


Unfortunately. the probabilities PO are rarely known in advance, Although it is 
usual for certain records te be retrieved more often than others. it is almost impossible 
to identify those records is advance. Also. dre provability that a given record wili be 
retrieved may change over time, To use the ampie of the coliege registrar given ear- 
lier, a student begins as a freshman thigh probability of retrieval) and then becomes a 
sophomore and a junior dow prebubility) before becoming a senior (high probability}. 
Thus it would be helpful to have an algorithm that continually reorders the table so 
that more frequently accessed records drift to the front. while less frequently accessed 
records drift to the back. 

There are two search methods thit accomplish this, One of these is known as the 
move-to-front method and is efficient only for a table organized as a list. In this method, 
whenever a search is successful (that is, When the argument is found to match the key 
of a given record), the retrieved record is removed from its current location in the list 
and is placed at the head of the list. 

The other method is the transposition method, in which a successfully retrieved 
record is interchanged with the record that immediately precedes it. We present an 
algorithm to implement the transposition method on a table stored as a linked list. The 
algorithm returns a pointer to the retrieved record, or the nul! pointer if the record is not 
found. As before, key is the search argument. k and rare the tables of keys and records. 
table is a pointer to the first node of the list. : 


q= s= nuil; /* q is one step behind Pi, 7 
/* s is two steps behind p */ 
for (p = table; p t= null & k(p) != key: p = next(p)) í 
S=Q 
q= p; i 
} /* end for */ 
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if (p == null) 
return (p); 
/* We have found the record at position p. */ 
/* Transpose the records pointed to by p and q. */ 
if (q == null) 
/* The key is in the first table position. */ 
/* No transposition is necessary. tf 
return (p); ; 
/* Transpose node(q) and node(p). */ 
next(q) = next{p); 
next(p) = q; 
(s = null) ? table = p : next(s) = p; 
return (p); 


Note that the two if statements in the foregoing can be combined into the single 
statement if (p == null |q == null) return (py. for conciseness. We leave the im- 
plementation of the transposition method for an array and the move-to-front method as 
exercices for the reader. 

Both of these methods are based on the observed phenomenon that a record that 
has been retrieved is likely to be retrieved again. By advancing such records toward 
the front of the table, subsequent retrievals are more efficient. The rationale behind 
the move-to-front method is that, since the record is likely to be retrieved again. n 
should be placed at the position within the table at which such retrieval is most efficient. 
However, the counterargument for the transposition method is that a single retrieval 
does not yet imply that the record will be retrieved frequently; placing it at the tient of 
the table reduces search efficiency for all the e:lier records that formerly preceded it. 
By advancing a record only one position each time that it is retrieved, we ensure that ii 
advances to the front of the list only if it is retrieved frequently. 

It has been shown that, over a large number of search requests with an unchanging 
probability distribution, the transposition method is more efficient. However. the move- 
to-front method yields better results for a small to medium number of requests and 
responds more quickly to a change in probability distribution. It also has better worst- 
case behavior than does transposition. For this reason, move-to-front is preferred in 
most practical situations involving sequential search. 

If large numbers of searches with an unchanging probability distribution are 
quired, a mixed strategy may be best: use move-to-front for the first s searches to or- 
ganize rapidly the list in good sequence and then switch to transposition to obtain even 
better behavior. The exact value of s to optimize overall efficiency depends on the lengih 
of the list and the exact access probability distribution. 

One advantage of the transposition method over the move-to-front method is that 
it can be applied efficiently to tables stored in array form as well as to list-structured 
tables. Transposing two elements in an array is a rather efficient operation, whereas 
moving an element from the middle of an array.to its front involves (on the average) 
moving half the array. (However. in this case the average number of moves is not se 
large, since the element to be moved most often comes from the upper portion of the 
array.: 
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Searching an Ordered Table 


If the table is stored in ascending or descending order of the record keys, there 
are several techniques that can be used to improve the efficiency of searching. This 
is especially trie if the table is of fixed size. One obvious advantage in searching a 
sorted file over searching an unsorted file is in the case that the argument key is ab- 
sent from the file. In the case of an unsorted file, n comparisons are needed to detect 
this fact. In the case of a sorted file, assuming that the argument keys are uniformly 
distributed over the range of keys in the file, only n/2 comparisons (on the average) 
are needed. This is because we know that a given key is missing from a file sorted 
in ascending order of keys as soon as we encounter a key that is greater than the 
argumeat. 

Suppose that it is possible to collect a large number of retrieval requests before 
any of them are processed. For example, in many applications a response to a request 
` for information may be deferred to the next day. In such a case, all requests in a spe- 
cific day may be collected and the actual searching may be done overnight, when no 
new requests are coming in. If both the table and the list of requests are sorted, the 
sequential search can proceed through both concurrently. Thus it is not necessary to 
search through the entire table for each retrieval request. In fact, if there are many such 
requests uniformly distributed over the entire table, each request will require only a 
few lookups (if the number of requests is less than the number of table entries) 
or perhaps only a single comparison (if the number of requests is greater than the 
number of table entries). In such situations sequential searching is probably the best 
method to use. 

Because of the simplicity and efficiency of sequential processing on sorted files, 
it may be worthwhile to sort a file before searching for keys in it. This is especially 
true in the situation described in the preceding paragraph, where we are dealing with a 
“master” file and a large “transaction” file of requests for searches. 


Indexed Sequential Search 


There is another technique to improve search efficiency for a sorted file, but it 
involves an increase in the amount of space required. This ‘method is called the in- 
dexed sequential search method. An auxiliary table, called an index, is set aside in 
addition to the sorted file itself. Each element in the index consists of a key kindex 
and a pointer to the record in the file that corresponds to kindex. The elements in the 
index, as well as the elements in the file, must be sorted on the key. If the index is one 
eighth the size of the file, every eighth record of the file is represented in the index. 
This is illustrated by Figure 7.1.1. 

The algorithm used for searching an indexed sequential file is straightforward. 
Let r, k, and key be defined as before, let kindex be an array of the keys in the index, 
and let pindex be the array of pointers within the index to the actual records in the file. 
. We assume that the file is stored as an array, that n is the size of the file, and that indxsze 
is the size of the index. 
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(Key) (Record) 


smn 


Figure 7.1.1 indexed sequential file. 


for (i = 0; i < indxsize & kindex(i) <= key; i++) 


lowlim = (i == 0): ? 0: pindex(i - 1); 
hilim = (i == indxsize) ? n- 1: pindex(i) - 1; 
for (j = lowlim; j <= hilim & k(j) != key; j++) 


return (C > hilim) ? -1: j); 


Note that in the case of multiple records with the same key, the foregoing algorithm 
does not necessarily return a pointer to the first such record in the table. 

The real advantage of the indexed sequential method is that the items in the table 
can be examined sequentially if all the records in the file must be accessed, vet the 
search time for a particular item is sharply reduced, A sequential search is performed 
on the smaller index rather than on the larger table. Once the correct index position has 


been found a second sequential search is performed on a small portion of the record 
table itself. 
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The use of an index is applicable to a sorted table stored as a linked list, as well 
as 10 one stored as an array. Use of a linked list implies a larger space overhead fur 
pointers, although insertions and deletions can be performed much more readily. 

If the table is so large that even the use of an index does not achieve sufficient 
efliciency (either because the index is large in order to reduce sequential searching in 
the table. or because the index is small se that adjacent keys in the index are far from 
each other in the table). a secondary index can be used. The secondary index acts as 
an index to the primary index. which points to entries in the sequential table. This is 
illustrated in Figure 7.1.2. 

Deletions trom an indexed sequential table can be made most easily by flagging 
deleted entries. In sequential searching through the table, deleted entries are ignored. 
Note that if an element is deleted, even if its key is in the index. nothing need be done 
to the index: only the original table entry is flagged. 

Insertion into an indexed sequential table isgmore difficult, since there may not 
be room between two already existing table entries, thus necessitating a shift in a large 
number of table clernents. However. if a nearby item has been flagged as deleted in the 
table. caly a few items need to be shifted and the deleted item can be overwritten. This 
mayan turn require iteration of the index if an item pointed to by an index element is 
shifted. An alternative method is to keep an overflow area at some other location and link 
together any inserted records. However, this would require an extra pointer field in cach 
record of the original table. You are asked to explore these possibilities as an exercise. 


Binary Search 


The most eflicient method of searching a sequential table without the use of aux- 
iliary indices or tables is the binary search. You should be familiar with this search 
technique from Sections 3.1 and 3.2. Basically. the argument is compared with the key 
of the middle element of the table. If they are equal. the search ends successfully: oth- 
erwise., either the upper or lower half of the table must be searched in a similar manner. 

In Chapter 3 it was noted that the binary search can best be defined recursively. As 
a result. a recursive detinition. a recursive algorithm, and a recursive program were pre- 
sented for the binary search. However, the overhead associated with recursion may make 
it inappropriate for use in practical situations in which efficiency is a prime consideration. 
We therefore present the following nonrecursive version of the binary search algorithm: 


Tow = 0; 
Aten- i: 
while (low <= hi) { 
mid = (low + hi)/2; 
if (key == k(micj) 
return(mid); 
if (hey < k(mid)) 
t= mid = 1; 
else 
1s mids 1; 
} Z ena while */ 
return(-2); 
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Each comparison in the binary search reduces the number of possible candidates 
by a factor of 2. Thus, the maximum number of key comparisons is approximately 
log2 n. (Actually, it is 2 * logs n since, in C, two key comparisons are made each time 
through the loop: key == (mid) and key <, k(mid). However, in assembly language 
or in FORTRAN using an arithmetic IF statement, only one comparison is made. An 
optimizing compiler should be able to eliminate the extra comparison.) Thus, we may 
say that the binary search algorithm is O(log n). 

Note that the binary search may be used in conjunction with the indexed sequen- 
tial table organization mentioned earlier. Instead of searching the index sequentially, a 
binary search can be used, The binary search can also be used in searching the main . 
table once two boundary records are identified. However, the size of this table segment 
is likely to be small enough so that a binary search is not more advantageous than a 
sequential search. i 

Unfortunately, the binary search algorithm can only be used if the table is stored 

as an array. This is because it makes use of the fact that the indices of array elements 
are consecutive integers. For this reason the binary search is practically useless in sit- 
uations where there are many insertions or deletions, so that an array structure is inap- 
propriate. : 
One method for utilizing binary search in the presence of insertions and deletions 
it the maximum number of elements is known involves a data structure known as the 
padded list. The method uses two arrays: an elemen: array and a parallel flag array. The 
element array contains the sorted keys in the table with “empty” slots initially evenly 
interspersed among the keys of the table to allow for growth. An empty slot is indicated 
by a O value in the corresponding flag array element, whereas a full slot is indicated 
by the value 1. Each empty slot in the element array contains a key value greater than 
or equal to the key value in the previous full slot and less than the key value in the 
following full slot. Thus the entire element array is sorted, and a valid binary search 
can be performed an it. 

To search for an element, perform a binary search on the element array. If the 
argument key is not found, the element does not exist in the table. If it is found and 
the corresponding flag value is 1, the element has been located. If the correspond- 
ing flag value is 0, check if the previous full slot contains the argument key. If it 
does, the element has been located: if it does not, the element does not exist in the 
table. 

To insert an element, first locate its position. If the position is empty, insert the 
element in the empty position, resetting its flag value to 1, and adjust the contents of all 
previous contiguous empty positions to equal the contents of the previous full element 
and of all following contiguous empty positions to the inserted element, leaving their 
flags at 0. If the position is full, shift forward by one position all the following elements 
up to the first empty position (overwriting the first empty position and resetting its flag 
to 1) to make room for the new element. Deletion simply involves locating a key and 
changing its associated flag value to 0. Of course, the drawbacks of this method are the 
shifting that must be done at insertion and the limited room for growth. Periodically, it 
may be desirable to redistribute the empty spaces evenly through the array to improve 
insertion speed. 
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interpolation Search 


. Another technique for searching an ordered array is called interpolation search. 
If the keys are uniformly distributed between k(0) and k(n — 1), the method may be 
even more efficient than binary search. 

Initially, as in binary search, low is set to 0 and high is set to n — 1, and throughout 
the algorithm, the argument key key is known to be between k(/ow) and k(high). On the 
assumption that the keys are uniformly distributed between these two values, key would 
be expected to be at approximately position i 


mid = low + (high — low) * ((key — k(low))/(k(high) — k(/ow))), 


If key is lower than k(mid ), reset high to mid — 1; if higher, reset low to mid + 1. Repeat 
the process until the key has-been found or low > high. 

Indeed, if the keys are uniformly distributed through the array, interpolation 
search requires an average of log2 (log. n) comparisons and rarely requires much 
more, compared with binary search's log) n (again. considering the two comparisons 
for equality and inequality of key and k(mid) as one): However, if the keys are not 
uniformly distributed. interpolation search can have very poor average behavior. In the 
worst case, the value of mid can consistently equal low + 1 or high — 1, in which case 
interpolation search degenerates into sequential search. By contrast, binary search's 
comparisons are never greater than approximately log. n. In practical situations. keys 
often tend to cluster uround certain values and are not uniformly distributed. For exam- 
ple, more names begin with “S” than with “Q,” and there are likely to be many Smiths 
and very few Quodnots. In such situations, binary search is far superior to interpolation 
search. 

A variation of interpolation search, called robust interpolation search (or fast 
search), attempts to remedy the poor practical behavior of interpolation search while 
extending its advantage over binary search to nonuniform key distributions. This is 
done by establishing a value gap so that mid-low and high-mid are always greater than 
gap. Initially, gap is set to sgrt(high — low + 1). probe is set to low + (high — low) * 
((key — k (low) \(k(high) — k(low))), and mid is set equal to min(high — gap. 
max(probe, low + gap)) (where min and ax return the minimum and maximum, 
respectively, of two values). That is, we guarantee that the next position used for com- 
Parison (mid) is at least gap positions from the ends of the interval, where gap is at 
least the square root of the interval. When the argument key is found to be restricted 
to the smaller of the two intervals, from low to mid and mid to high. gap is reset to the 
square root of the new interval size. However, if the argument key is found to lie in 
the larger of the two intervals, the value of gap is doubled, although it is never allowed 
to be greater than half the interval size. This guarantees escape from a large cluster of 
similar key values. 

The expected number of comparisons for robust interpolation search for a ran- 
dom distribution of keys is O(log log n). This is superior to binary search. On a list 
of approximately 40,000 names, binary search requires an average of approximately 
16 key comparisons. Owing to clustering of names in practical situations. interpola- 
tion search required 134 average comparisons in an actual experiment, whereas robust 
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interpolation search required only 12.5. Ona uniformly distributed list of approximately 
46,060 elements—log> (logs 40.000) is approximately 3.9-——robus interpolation search 
required 6.7 average comparisons. (It should be noted that the extra computation time 
required for robust interpolation search may be substantial but is tgnored in these find- 
ings ) The worst case for robust interpolation search is (O( log n°) comparisons. which 
is higher than that for binary search. but much better than the O(n) of regular interpo- 
lation search. 

However. on most computers. the computations required by interpolation search 
are very slow, since they involve arithmetic on keys and complex multiplications and 
divisions. Binary search requires only arithmetic on Mleger indexes and divisor by 2 
which can be performed efficiently by shifting one bit to the night. Thus the computa- 
tional requirements of interpolation search often cause it to perform more slowly than 
binary search even when it requires fewer comparisons. 


EXERCISES 


TAL. Modify the scareh and Insertion algorithms of this section se that they become update 
algorithms, [fan aigerithm finds an i such that kev equals ks, change the value u rij: 
tO TeC. 

LZ haplement the sequential search and the sequential search and insertion alesrithms in 
© for doth arrays and linked ists. 

ALI Compare the ehhoienes or searching anterdered sequential tabic of size n and searching 

an unordered lable af the same size tor the hes kev: 

fab if no record with key kev is present 

th) Hone record with hey Ack is present and only one is sought 

ce) TP mere than one record with hey Aes is present anu it is desired te tind only the 

first one 

tdi If more than one record with key kev is present and it is desired u 


TL4. Assume that an ordered table is stored as a circular jist with two external perae 
and ether. table always points to the node containing the record with the sminitest Key 
other is minalh equal io table but is reset each time a search is performed t point to 
the record that is retrieved. Ifa search is unsuccessful, other is reset to rable. Wine a C 
routine searchitabic, other, kev) that implements this method and that returns i ponner 
toa retrieved record or a null pointer if the search is unsuccessful. Explain hew keeping 
the pointer other can reduce the average number of comparisons in a search, 

7AL5. Consideran ordered table implemented as an array or as a doubly linked list se that the 
table can be searched sequentially either backward or forward. Assume thet u single 


pore: p points to dhe jast record successtully retrieved. The search always begins atthe 
record pointed to by p but May Proceed m either direction. Write a routne sears lttable, 
P. Key) for the case of an array and a doubly linked iist to retrieve a record with kes 
her and to medii, p accordingh. Prove that the numbers of key comparisons m hoth 
the successtul and unsuccesstul cases are the same as in the method of Exercise TLs. 
in which the table may be scanned in only one direction but the scanning process may 
Stari ai ope of two points, 


TAG. Considera Programmer who writé the following code: 
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7.1.9. 


7.1.0. 


l 


if (cy 3 
if (ce ) 
if (¢3 ) 
i if (Cy ) 
{ statement } 


where c, is a condition that is either irue or false. Note that rearranging the conditions in 
a dificrent order results in an equivalent program, since the {siatement! isonly executed 
n althe c, are true. Assume that rimet is the ume needed to evaluate Condition c, and 
that prohi iiis the probability tfit condition ets true. In what order should the conditions 
be arranged to make the program most efficient? 

Modity the indexed sequential search se that in the case of multiple records with the 
same hey tt returns the first such record in the table. 


Consider the tollow int © implementation of an indexed sequential file: 


ct tabretwme 7 
int v À 
int r: 
int Mag; 
5 


Struct isfiletspe f 
Struct indxtype indy fINDXSI7E); 
Struct tanletyoe tablefTABLESTZ£}; 
E 


Struct isfiletspe isfile: 


Write a C routine crcatetistide) that imtiatizes such a file trom mput data. Each input 
lins contains a key and a record. The input is seried in ascending key order. Each mdex 
entry corresponds te ten tahle entries. ay ts set te TRUE in an occupied table entry and 
19 FALSE in an unoccupied entry. Two out of every ten table entries are lett unoccupied 
to allow for fuiure growth, 


Given an indexed segue tije as m the previous exercise. write a C roume 
searchiistile. sews to print the record m the tile with key kev if it is present and an 
indication that the record is missing if no record with that key exists (How cu you 
ensure that an unsuccessful sc ts as efficient as possible’: Abe. write routines 
insertistile, key. reo) iv insert a record reo with key kev and delete: fate te iin delete 
the record with key kev, 


Consider the following version of the binary search. whieh assume chat the keys are 
comtamed in ACH) throueh ten and that the special element AcQi as smaller han o ery 
possible hey: 
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mid = n/2; 
len = (n - 1)/2; 
while (key != k(mid)) { 
if (key < k(mid)) 
mid -= len/2; 
else 
mid += len/2; 
if (Ten == 0) 
return(-1); 
len /= 2: 
} /* end while */ » 
return(mid); 


Prove that this algorithm is correct. What are the advantages and/or disadv antages of 
this method over the method presented in the text? 


7.1.11. The following search algorithm on a sorted array is known as the Fibonaccian search 


because of its use of Fibonacci numbers. (For a definition of Fibonacci numbers and the 
fib function, see Section 3.1.) 


for (j = 1; fib(j) < mj j++) 


mid =n - fib(j - 2) +1; 
fl = fib(j - 2); 
f2 = fib(j - 3); 
while (key != k(mid)) 
if (mid < 0 key > k(mid)) { 
if (fl == 1) 
return(-1); 
mid += f2; 
fl -= f2; 
f2 -= fl; 
} 
else { 
if (N == 0) 
return(-1); 
mid -= f?; 
tafl- f?; 
Fl = f2; 
Net; 
} /* end if */ 
return(mid); 


Explain how this algorithm works. Compare the number of key comparisons with the 
number used by the binary search. Modify the initial portion of this algorithm so that it 
computes the Fibonacci numbers ney: rather than jooking them up in a table or 
computing each anew. 

7.1.12. Modify the binary search of the text so that in the case of an unsuccessful search it 
returns the index i such that k(i) < key < k(i + 1). If kev < k(0). it returns —1, and if 


key > k(n — 1), it returns n — 1. Do the same for the searches of Exercises 7.1.10 and 
Pith : 
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7.2 TREE SEARCHING 


In Section 7.1 we discussed search operations on a file that is organized either as an 
array or as a list. In this section we consider several ways of organizing files as trees 
and some associated searching algorithms. 

In Sections 5.1 and 6.3 we presented a method of using a binary tree to store a file 
in order to make sorting the file more efficient. In that method, all the left descendants- 

-of a node with kev key have keys that are less than key, and all the right descendants 
have keys that are gre. er than or equal to key. The inorder traversal of such a binary 
tree yields the file in asccinding key order. 

Such a tree may also be used as a binary search tree. Using binary tree notation, 
the algorithm for searching for the key key in such a tree is as follows (we assume that 
each node contains four fields: k, which holds the record’s key value, r, which holds the 
record itself, and left and sngt. which are pointers to the subtrees): ; 


p= tree; 

eit (p != nul] && key != k(p}) f 
= (key < k(p)) ? left(p) : right(p); 

retu; 


The efficiency of the search process can be improved by using a sentinel, as in 
sequential searching. A sentinel nede, with a separate external pointer pointing to it, 
remains allocated with the tree. All left or right tree pointers that do not point to another 
tree node now point to this sentinel node instead of equaling null. When a search is 
performed, the argument key is first inserted into the sentinel node, thus guaranteeing 
that it will be located in the tree. This enables the header of the search loop to be written 
while (key! = k(p)) without the risk of an infinite loop. After leaving the loop, if p 
equals the external sentinel pointer, the search is unsuccessful; otherwise p points to 
the desired node. We leave the actual algorithm to the reader. 

Note that the binary search of Section 7.1 actually uses a sorted array as an implicit 
binary search tree. The middle element of the array can be thought of as the root of the 
tree, the lower half of the array (all of whose élements are less than the middle element) 
can be considered the left subtree, and the upper half (all of whose elements are greater 
than the middle element) can be considered the right subtree. 

A sorted‘array can be produced from a binary search tree by traversing the tree 
in inorder and inserting each element sequentially into the array as it is visited. On the 
other hand, there are many binary search trees that correspond to a given sorted array. + 
Viewing the middle element of the array as the root of a tree and viewing the remaining . 
elements recursively as left and right subtrees’ produces a relatively balanced binary 
search tree (see Figure 7.2.1a). Viewing the first element of the array as the root of a 
tree and each successive element as the right son of its cay gates produces a very 
unbalanced binary tree (see Figure 7.2.1b). 

: The advantage of using a binary search tree Over an array is that a tree enables 
search, insertion, and deletion operations to be performed efficiently. If an array is used, 
an insertion or deletion requires that approximately half of the elements of the array be 
moved. (Why?) Insertion or deletion in a search tree, on the other hand, requires that 
only.a few pointers be adjusted. 
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Figure 7.2.1 Sorted array and two of its binary tree representations. 
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Figure 7.2.1 (cont) 
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Inserting into è Binary Search Troe 


“The following algorithm séarches a binary search tree and inserts’a new record 
-into the tree if the search is unsuccessful, (We assume the existence of a function make- 
‘tree that constructs a binary tree consisting of a single node whose information field is 
passed as an argument and returns a pointer to the tree. This function is described in 
` Section 5.1. However, in our particular version, we assume that maketree accepts two. 
arguments, a resor; and a key.) 


q= mitts 
p= tree; - 
while (p != null) { 
if (key == k(p)) 
return(p) ; 


G = p; 
if (key < k(p)) 
p = left(p); 
, else 
p = right(p); 
} /* end while */ 
v = maketree(rec, key); 
if (q == null) ` 
tree = v; 
else ; 
if (key < k(@)) 
left(q) = V; 
else ‘ . 
right(q) = v; 


rera) : 


Note that after a new record is inseftdd, the tree retains the property of being sorted in ` 
an inorder traversal. 


Dejatimg; from a Binery Search Tree 


We now present an sigotithm to delete a node with key int from a binary search 
tree. There are three cases to consider. If the node to be deleted has no sons, it may be 
deleted without further adjustment to the tree. This is illustrated in Figure 7.2.2a. If the 
node to be deleted has only one subtree, its only son can be moved up to take its place. - 
‘This is illustrated in Figure 7.2.2b. If, however, the node p to be deleted has two sub- 
trees, its inorder successor s (or predecessor) must take its place. The inorder successor 
cannot have a left subtree (since a left descendant would be the inorder successor of p). 
Thus the right son of s can be moved up to take the place of s. This is illustrated in Fig- 
ure 7.2.2c, where the node with key 12 replaces the node with key 11 and is replaced, 
Jin turn, by the node with key 13. 

In the following algorithm, if no node with key key exists in the tree, the tree is 
left urichanged. - è 
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si = tree; 


= null; 
j search for the AFH with the key key, set p to point */ 
Vi to the node and q to its father,- if any. */ 
while (p != null & k(p) != key) { i 
A pi 


: = (key < KCP) ? -Teft(p) : right(p); 
} ia end while */ 
if (p == null) ` i 
/* the key does not exist in the tree */ 
i fa. leave the tree unchanged */ 
return; 


- `. (b) Deleting node with key 5. 


Figure 7.2.2 , Deleting nodes from a binary search tree. 
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< (c) Deleting node with key ʻi. 


Figure 7.2.2 (cont.) 


A 


/* set the variable rp to the node that will replace */ 


vi node(p). *y 
/* first two cases: the node to be deleted has at most*/ 
/* one son */ 


if (left(p) == null) 
rp = right(p); 
else 
if (right(p) == null) 
rp = left(p); 
else { 
/* third case: node(p) has two sons. Set rp to the */ 
/* inorder successor of p and f to the father of rp */ 
f =p; j 
rp = right(p); 
s = left(rp); /* sis always the left son of rp */ 
while (s = null) { t 
f= rp; 
rp = S; 
s = left(rp); 
} > /* end while */ 
/* at this point, rp is the inorder successor of p */ 
if (f != p) { 
/* pis not the father of rp and rp == left(f) */ 
left(f) = right(rp); 
/* remove node(rp) from its current position and */ 
/* replace it with the right son of node(rp) */ 
J ‘node(rp) takes the place of node(p) */ 
right(rp) = r`ght(p); 
} /* end if */ 
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~/* set the left son of node(rp) so that */ 
/* node(rp) takes the place of node(p) */ 
left(rp) = left(p); 
} /* end if */ 
/* insert node(rp) into the position formerly */ 
p occupied by node(p) sf 
if (q == nul?) 
/* node(p) was the root of the tree */ 
tree = rp; 
else 
(p == Teft(q)) ? “Teft(q) = rp : right(q) = rp; 
freenode(p) ; 
return; 


Efficiency of Binary Search Tree Operations 

As we have already seen in Section 6.3 (see Figures 6.3.1 and 6.3.2), the time 
required to search a binary search tree varies between O(n) and O(log n), depending 
on the structure of the tree. If elements are inserted into the tree by the foregoing inser- 
tion algorithm, the structure of the tree depends on the order in which the records are 
inserted. If the records aré inserted in sorted (or reverse) order, the resulting tree con- 
tains all nuli left (or right) links, so -hat the tree search reduces to a sequential search. 
If, however, the records are inserted so that half the records inserted after any. given . 
record r with key k have keys smaller than k’and half have keys greater than k, a bal- ` 
anced tree is achieved in which approximately log n key comparisons are sufficient to 
retrieve an element. (Again, it should be noted that examining a node in our insertion 
algorithm requires two comparisons: one for equality and the other for less than. How- 
ever, in machine language and in some compilers, these can be combined into a single 
comparison.) 

If the records are presented in random order (that is, any permutation of the n 
elements is equally likely), balanced trees result more often than not, so that on the 
average, search time remains O(log n). To see this, let us define the internal path length, 
i, of a binary tree as the sum of the levels of all the nodes in the tree (note that the level 
of a node equals the length of the path from the root to the node). In the initial tree of 
Figure 7.2.2, for example, i equals 30 (1 node at level 0, 2 at level 1, 4 at level 2, 4 at 
level 3, and 2 at level 4: F*0+2*1+4*2+4*%3+42%4 = 30). Since the number 
of comparisons required to access a node in a binary ‘search tree is one greater than 
the node’s level, the average number of comparisons required for a successful search 
in a binary search tree with z nodes equals (i + n)/n, assuming equal likelihood for 
accessing every node in the tree. Thus, for the initial tree of Figure 7.2.2, (30 + 13)/13, 
or approximately 3.31, comparisons are required for a successful search. Let s, equal 
the average number of comparisons réquired for a successful search in a random binary 
search tree of n nodes in which the search argument is equally likely to be any of the n 
keys, and let i, be the uverage internal path length of a random binary search tree of n 
nodes. Then s,, equals (i, + n)/n. 

Let u, be the average number of comparisons required for an unsuccessful search 
of a random binary search tree of n nodes. There are n + l+possible ways for an un- 
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successful search for a key key to-occur: key is less that k(1), key is.between k(1) and 
k(2), ... key is between k(n = 1) and k(n), and key is greater than k(n). These correspond 
to the n + 1 null subtree pointers in any n-node binary search. (It can be shown that any . 
binary search tree with n nodes has n + 1 null pointers.) 

Consider the extension of a binary tree formed by replacing each null left or 
right pointer with a pointer to a separate, new leaf node, called an external node. The 
extension of a binary tree of n nodes has n + 1 external nodes, each corresponding to 
one of the n + | key ranges for an unsuccessful search. For example, the initial tree of 
Figure 7.2.2 contains 13 nodes. Its extension would add two external nodes as sons of 
each of the leaves containing 1,7, 10, 13, and 15 and one external node as ari additional 
son of each of the nodes containing 5, 6, 9, and 12, for a total of 14 external nodes. Define 
the external path length, e, of a binary tree as the sum of the levels of all the external 
nodes of its extension. The extension of the initial tree of Figure 7.2.2 has 4 external 
nodes at level 3, 6 at level 4, and 4 at level 5, for an external path length of 56. Note 
that the level of an external node equals the number of comparisons in an unsuccessful ` 
search for a key in the range represented by that external node. Then if e, is the average 
external path length of a random binary search tree of n nodes, un = €„/(n + 1). (This 
‘assumes that each of the n + | key ranges is equally likely in an unsuccessful search. 
In Figure 7.2.2 the average number of comparisons for an unsuccessful search is 56/14, 
or 4.0.) However, it can be shown that e = i + 2n for any binary tree of n nodes (for 
example, in Figure 7.2.2, 56 = 30+2*13),sothate, = i,+2n. Since s, = (Un +n)/n 
and un. = en/(n + 1), this means that s, = ((n + 1)/n)un - 1. 

The number of comparisons required to access a key is one more than the number 
required when the node was inserted. But the number required to insert a key equals 
p number required in an unsuccessful search for that key before it was inserted. Thus | 

= 1+ (uotu t: tun- 1)/n. (That is, the average number of comparisons in 
aN an item in an n-node tree equals the average number in accessing each of 
the first item through the nth, and the number for accessing the ith equals one more 
than the number for inserting the ith or 1 + uj;-.) Combining this with the equation 

= ((n + 1)/n)un — 1 yields , 


si, itin ai oro hs a ey + + Un} 
for any n. Replacing n by n — 1 yields 
Nún- = 2(n- 1) + uo + tos + Un- 

and subtracting fom the previous equation yields 

(n+ 1)uy — nün-i = 2 + üni 
or 
Un = Up +2/(n + 1) 
i Since uy = 1, we have that r , 

EE + 2/3 PEIE AE E 
„and, therefore, since sn = ((n + 1)/m)uy — 1, that 


Sn = A(n + 1)/n) + 1/24 1/3 +e +1/n)-3 
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As n grows large, (n + 1) n is approximately 1, and it can be shown that 
1+ 1/2 +--+: + 1/n is approximately log (n), where Jog (n) is defined as the nat- 
ural logarithm of 7 in the standard C library file math.h. Thus Sn may be approximated 
(for large n) by 2 * log (n), which equals 1.386 * log) n. This means that average search. 
time in a random binary search tree is O(log n) and requires approximately ‘only 39 
percent more comparisons, on the average, than in a balanced binary tree. `. 
` As already noted, insertion in a binary search tree requires the same number.of 
comparisons as does an unsuccessful search for the key. Deletion requires the same 
‘ number of comparisons as does a search for the key to be deleted, although it does 
involve additional work in finding the inorder successor or predecessor. It can be shown 
that the deletion algorithm that we have presented actually improves the subsequent 
average Search cost of the tree. That is, a random n-key tree created by inserting n+ | 
keys and then deleting a random key has lower internal path length (and therefore lower 
average search cost) than does a random n-key tree created by inserting n keys. The 
process of deleting a one-son node by replacing it with its son, regardless of whether 
that son is a right or a left son, yields a better-than-average tree; a similar deletion 
algorithm which only replaces a one-son node by its son if it is a left son (that is, if 
its successor is not contained in its subtree) and-otherwise replaces the node with its 
inorder successor does produce a random tree, assuming no additional insertions. The 
. latter algorithm is called the asymmetric deletion algorithm. ; 


Strangely enough, however, as additional insertions and deletions are made using 
the asymmetric deletion algorithm, the internal path length and search time initially 
decrease but then begin to rise rapidly again. For trees containing more- than 128 keys, 
` the internal path length eventually becomes worse than that for a random tree, and for 

trees with more than 2048 keys, the internal path length eventually becomes more than 
50 percent worse than that of a random tree. R , 

An alternative symmetric deletion algorithm, which alternates between deleting 
the inorder predecessor and’successor on alternate deletions (but still replaces a one-son 
node with its son only when the predecessor or successor, respectively, is not contained 
in its subtree), does eventually produce better than random trees after additional mixed 
insertions and deletions. Empirical data indicate that path length is reduced after many 
alternating insertions and symmetric deletions to approximately 88 percent of its cor- 
responding random value. : E 2 


: Efficiency of Nonuniform Binary Search Trees . 


_ All of the preceding assumes that it is equally likely for the search argument to 
equal any key in the table. However, in actual practice it is usually the case that some 
records are retrieved very often, some-moderately often, and some are almost never 
retrieved. Suppose that records are inserted into the tree so that a more commonly ac- 
cessed record precedes one tiat is not so frequently accessed. Then the most frequently 
retrieved records will be nearer the root of the tree, so that the average successful search 
time is reduced. (Of course, this assumes that reordering the keys in order of reduced 
frequency of access does not seriously unbalance the binary tree, since if it did the re- 
duced number of comparisons for the most frequently accessed records might be offset 
by the increased number of comparisons for the vast majority of records.) 
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If the elements to be retrieved form a constant set, with no insertions or deletions, 
it may pay to set up a binary search tree that makes subsequent searches: more efficient. 
For example, consider the binary search trees of Figure 7.2.3. Both the trees of Figure 
7.2.3a and b contain three elements k1, k2, and k3, where kl < k2 < k3, and are valid 
binary search trees for that set. However, a retrieval of k3 requires two comparisons in 
Figure 7.2.3a but requires only one comparison in Figure 7.2.3b. Of course, there are 
still other valid binary search trees for this set of keys. 

The number of key comparisons necessary to retrieve a record equals the level 
of that record in the binary search tree plus 1. Thus a retrieval of k2 requires one com- 
parison in the tree of Figure 7.2.3a but requires three comparisons in the tree of Fig- 
ure 7.2.3b. An unsuccessful search for an argument lying immediately between two 
keys a and b requires as many key comparisons as the maximum number of compar- 
isons required by successful searches for either a or b. (Why?) This is equal to 1 plus 
the maximum of the levels of a or b. For example, a search for a key lying between 
k2 and k3 requires two key comparisons in Figure 7.2.3a and three comparisons in 
Figure 7.2.3b, whereas a search for a key greater than k3 requires two comparisons in 
Figure 7.2.3a, but only one comparison in Figure 7.2.3b. 

Suppose that p1, p2, and p3 are the probabilities that the search argument equals 
kl, k2,-and k3, respectively. Suppose also that gO is the probability that the search 
argument is less than k1, q1 is the probability that it is between k1 and k2, q2 is the 
probability that it is between k2 and K3, and 43 is the probability that it is greater than 
k3. Then pl + p2 + p3 + q0 + q1 + q2 + q3 = 1. The expected number of compar- 


Gs 
T D 


(a) Expected number of comparisons: 
2pl + p2 +2p3 +.2g0 + 2q1-+ 2q2 + 243 


Figure 7.2.3 Two binary search trees. 
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isons ina search is the sum of the probabilities that the argument has a given value times 
the number of comparisons required to retrieve that value, where the sum is taken over 
all possible search argument values, For example, the expected number of comparisons 
in searching the tree of Figure 7.2.3a is ; 


2pl + p2 + 2p3 + 240 + 2g] + 2g2 + 243 
and the expected number of comparisons in searching the tree of Figure 7.2.3b is 
2pl + 3p2 + p3 + 240 + 3q1 + 3q2 + q3 


This expected number of comparisons can be used as a measure of how “good” a par- 
ticular binary search tree is for a given set of keys and a given set of probabilities. Thus, 
for the following probabilities on the left, the tree of Figure 7.2.3a is more efficient; for 
the probabilities listed on the right, the tree of Figure 7.2.3b is more efficient. 


pi=.1 pl=.1 
p2= .3 p2=.1 
p3 =.1 p3-= 3 

=.) =.1 
ql = .2 gi=.l 
q2=.1 q2=.1 
q3 =.l q3 =.2 


Expected number for 7.2.3a = 1.7 Expected number for 7.2.3a = 1.9 
Expected number for 7.2.3b = 2.4 Expected number for 7.2.3b = 1.8 


Optimum Search Trees 


A binary search tree that minimizes the expected number of comparisons for a 
given set of keys and probabilities is called optimum. The fastest-known algorithm to 
produce an optimum binary search tree is O(n) in the general case. This is too expen- 
sive unless the tree is maintained unchanged over a very large number of searches. In 
cases in which all the p(i) equal 0 (that is, the keys act only to define range values with 
which data are associated, so that all searches are “unsuccessful”), an O(n) algorithm 
to create an optimum binary search tree does exist. ; 

However, although an efficient algorithm to construct an optimum tree in the gen- 
eral case does not exist, there are several methods for constructing near-optimum trees 
in O(n) time. Assume n keys, k(1) through k(n). Let p(i) be the probability of search- 
ing for a key k(i), and q(i) the probability of an unsuccessful search between k(i — 1) 
and k(i) (with 4(0) the probability of an unsuccessful search for a key below k(1), and 
q(n) the probability of an unsuccessful search for a key above k(n)). Define s(ij) as 
gi) + pli + 1) +++» + ajy. 

One method, called the balancing method, attempts to find a value i that min- 
imizes the absolute value of s(0,i —; 1) — s(i,n) and establishes k(i) as the root of 
the binary search tree. vith k(1) through k(i — 1) in its left subtree and kli + i) 
through k(n) in its right subtree. The process is then applied recursively to build the 
left and right subtrees. j 


Sec. 7.2 Tree Searching 411 


Locating the value i i at which abs(s(0,i — 34) — s(in)) is minimized can be done ` 
- efficiently as follows. Initially, set up an array sO[n+1] such that òli] equals s(0,i). 
This can be done by initializing sO[0] to g(0) ci sol] to sU — 1] + pG) + g(j) forj J 
from | to n in turn. Once sO has been initialized, s(i,j) can be computed for any i and j 
as sO[j] — sO[i — 1] — pCi) whenever necessary. We define si). as s(0j — 1) — sQ, n). 

We wish to minimize abs(si(i)). ; 

After sO has been initialized, we begin the process of finding an i to minimize 
abs(s(0, i — 1) — s(i, n)), or si(i). Note that. si is‘a monotonically increasing function. 
Note also that si(0) = g(0) — 1, which is negative, and si(n) = 1 — q(n + 1), which is 

_ positive. Check the values of si(1), si(n), si(2), si(n — 1), si(4), si(n — 3),..., si(2/), 
si(n + 1 — 24) in turn until discovering the. first positive si(2/) or the first negative 
si(n + 1 — 21). Ifa positive si(2/) is found first, the desired i that minimizes abs(si(i)) . 
lies within the interval (2/~!, 24]; ifa negative si(n + 1 — 2/) is found first, the desired 
i lies within the interval [n + 1 — 2/,n + 1’ — 2/7"]. In either case, į has been narrowed 
down to an interval of size 2/~'. Within the interval, use a binary search to narrow down 
on i. The doubling effect in the interval size guarantees that the entire recursive process 
is O(n), whereas if a binary search were used on the entire interval [0, n] to start, ‘the 
process would be O(n iog n). 

A second method used to construct near-optimum binary search trees is called 
the greedy method. Instead of building the tree from the top down, as'in the balancing 

` method, the greedy method builds the tree from the botiom up. The method uses a 
doubly linked linear list in which each list element contains four pointers, one key 
value, and three probability values. The four pointers are left and right list point-. 
ers used to organize the doubly linked list and left and right subtree pointers. used 
to keep track of binary search, subtrees containing keys less than and greater than 
the key value in the node. The three probability values are the sum of the probabil- 
ities in the left subtree, called the left probability, the probability p(i) of the node’s ` 
key value k(i), called the key probability, and the sum of the probabilities i in the 
right subtree, called the right probability. The total probability of a node is defined 
as the sum of its left, key, and right probabilities. Initially, there are n nodes in the 
list. The key value in the ith node is K(i), its left probability is g(i — 1), its right 
probability ‘is qli), its key Pegpablity: 4 is p(i), and its left and right subtree pointers 
are null. 

Each iteration of the algorithm finds the first node nd on the list whose total prob- 
-ability is less than or equal to its successor’s (if no node qualifies, nd is set to the last 
node in the list). The key in nd becomes the root of a binary search subtree whose left ° 
and right subtrees are the left and right ‘subtrees of nd. nd is then'removed from the 
list. The left subtree pointer of its successor (if any) and the right subtree pointer of its 
predecessor (if any) are reset to point to the'new subtree, and the left probability of its 
successor and the right probability of its predecessor are reset to the total probability of 
nd. This process is repeated until only one node remains on the list. (Note that it is not 
necessary to begin a full Jist traversal from the list beginning on each iteration; it is only 
necessary to begin from the second predecessor of the node removed on the previous 
iteration.) When only. one node remains on the list, its key is placed in the root of the 
final binary search tree, with the left and right subtree pointers of the mode as the left 
and right subtree pointers of the root. 
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Another technique of reducing average search time when search probabilities are 
known is a split tree. Such a tree contains two keys rather than one in each node. The 
first, called the node key, is-tested for equality with the argument key. If they are equal, 
. the search ends successfully; if not, the argument key is compared with the second key 

in the node, called the split key, to determine if the search should continue in the left or 
right subtree. A particular type of split tree, called a median split tree, sets the node key 
in each node to the most frequent among the keys in the subtree rooted at that node and 
sets the split key to the median of the keys in that subtree (that is, the key k such that an 
equal number of keys in the subtree are less than and greater than k). This has the twin 
advantages of guaranteeing a balanced tree and assuring that frequent keys are found 
near the root. Although median split trees require an extra key in each node, they can 
be constructed as almost complete.binary. trees implemented in an array, saving space 


`. for tree pointers. A median split tree from a given set of keys and frequencies can be 


- built in time O(n log n), and a search in such a tree always requires fewer than log2 n 
node visits, although each visit does require two separate comparisons. 


Balanced Trees 


As noted in the foregoing, if the probability of searching for a key in a table is the 
same for all keys, a balanced binary tree yields the most efficient search. Unfortunately, 
the search and insertion algorithm presented in the foregoing does not ensure that the 
tree remains balanced; the degree of balance is dependent on the order in which keys are 
inserted into the tree. We would like to have an efficient search and insertion algorithm 
that maintainsthe search tree as a balanced binary tree. 

Let us first define more precisely the notion of a “balanced” tree. The height 
of a binary tree is the maximum level of its leaves (this is also sometimes known as 
the depth of the tree). For convenience, the height of a null tree is defined as —1. A 
balanced binary tree (sometimes called an AVL tree) is a binary tree in which the 
heights of the two subtrees of every node never differ by more than 1. The balance of 
a node in a binary tree is defined as the height of its left subtree minus the height of its 
right subtree. Figure 7.2.4 illustrates a balanced binary tree. Each node in a balanced 
binary tree has a balance of 1, —1, or 0, depending on whether the height of its. left 
subtree is greater than, less than, or equal to the height of its right subtree. The balance 
of each node is indicated in Figure 7.2.4a. 

Suppose that we are given a balanced binary tree and use the preceding search 
and insertion algorithm to insert a new node p into the tree. Then the resulting tree may 
or may not remain balanced. Figure 7.2.4b illustrates all possible insertions that may be 
made to the tree of Figure 7.2.4a. Each insertion that yields a balanced tree is indicated 

by a B. The unbalanced insertions are indicated by a U and are numbered from 1 to 12. . 

` Itis easy to see that the tree becomes unbalanced only if the newly inserted node is a left 
descendant of a node that previously had a balance of 1 (this occurs in cases U1 through 
U8 in Figure 7.2.4) or if it is a right descendant of.a node that previously had a balance 
“of —1 (cases U9 through U12). In Figure 7.2.4b, the youngest ancestor that becomes 
unbalanced in each insertion is indicated by the numbers contained in three of the rides. 
Let us examine further the subtree rooted at the youngest ancestor to become 
‘unbalanced as a result of an insertion. We illustrate the case where the balance of this 
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Figure 7.2.4 Balanced binary tree and possible additions. 


subtree was previously 1, leaving the other case to the reader. Figure 7.2.5 illustrates 
this case. Let us call the unbalanced node A. Since A had a balance of 1, its left subtree 
was nonnull; we may therefore designate its left son as B: Since A is the youngest 
ancestor of the new node to become unbalanced, node B must have had a balance of 0. 
(You are asked to prove this fact as an exercise.) Thus, node B must have had (before 
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Figure 7.2.5 Initial insertion; all bal- 
(b) ances are prior to insertion, 
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the insertion) left and right subtrees of equal height n (where possibly n = —1). Since 
` _ the balance of A was 1, the right subtree of A must also have been of height n. 

There are now two cases to consider, illustrated by Figure 7.2.5a and b. In Figure 
7.2.5a the newly created node is inserted into the left subtree of B, changing the balance ` 
of B to 1 and the balance of A to 2. In Figure 7.2.5b the newly created node is inserted 
into the right subtree of B, changing the balance of B to — 1 and the balance of A to 2. To 
maintain a balanced tree it is necessary to perform a transformation on the tree so that 


1. The inorder traversal of the transformed tree is the same’as for the original tree 
(that is, the transformed tree remains a binary search tree). 
2. The transformed tree is balanced. 


` (a) Original tree. 


(c) Left rotation. 


Figure 7.2.6 Simple rotation on a tree. 
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Consider the trees of Figures 7.2.6a and b. The tree of Figure 7.2.6b is said to be 
a right rotation of the tree rooted at A of Figure 7.2.6a. Similarly, the tree of Figure 
7.2.6¢ is said to be a left rotation of the tree rooted at A of Figure 7.2.6a. 

An algorithm to implement a left rotation of a subtree rooted at p is as follows: 


q = right(p); 
hold = left(q); 
left(q) = p; 
right(p) = hold; 


Let us call this operation /eftrotation(p). rightrotation(p) may be defined simi- 
larly. Of course, in any rotation the value of the pointer to the root of the subtree being 
rotated must be changed to point to the new root. (In the case of the foregoing left ro- 
tation, this new root is g.) Note that the order of the nodes in an inorder traversal is 
preserved under both right and left rotations. It therefore follows that any number of 
rotations (left or right) can be performed on an unba'onced tree to obtain a balanced 
tree, without disturbing the order of the nodes in an inoruer traversal. 

Let us now return to the trees of Figure 7.2.5. Suppose that a right rotation is per- 
formed on the subtree rooted at a in Figure 7,2.5a. The resulting tree is shown in Figure 
7.2.7a. Note that the tree of Figure 7.2.7a yields the same inorder traversal as that of 
Figure 7.2.5a and is also balanced. Also, since the height of the subtree of Figure 7.2.5a 
was n + 2 before the insertion and the height of the subtree of Figure 7.2.7a is n +2 with 
the inserted node, the balance of each ancestor of node A remains undisturbed. Thus re- 
placing the subtree of Figure 7.2.5a with its right rotation of Figuré 7.2.7a guarantees 
that a balanced binary search tree is maintained. 

Let us now turn to the tree of Figure 7.2.5b, where the newly created node is 
inserted into the right subtree of B. Let C be the right son of B. (There are three cases: 
C may be the newly inserted node, in which case n = —1, or the newly inserted node 
may be in the left or right subtree of C. Figure 7.2.5b illustrates the case where it is in the 
left subtree; the analysis of the other cases is analogous.) Suppose that a left rotation on 
the subtree rooted at B is followed by a right rotation on the subtree rooted at A. Figure 
7.2.7b illustrates the resulting tree. Verify that the inorder traversals of the two trees are 
the same and that the tree of Figure 7.2.7b is balanced. The height of the tree in Figure 
7.2.7 is n + 2, which is the same as the height of the tree in Figure 7.2.5b before the 
insertion, so that the balance in all ancestors of A is unchanged. Therefore by replacing 
the tree of Figure 7.2.5b with that of Figure 7.2.7b wherever it occurs after insertion, a 
balanced search tree is maintained. : 

Let us now present an algorithm to search and insert into a nonempty balanced 
binary tree. Each node of the tree contains five fields: k and r, which hold the key 
and record respectively, left and right, which are pointers to the left and right subtrees 
respectively, and bal, whose vaiue is 1, —1, or 0, depending on the node’s balance. 
In the first part of the algorithm, if the desired key is not found in the tree already. a 
new node is inserted into the binary search tree without regard to balance. This first 
phase also keeps track of the youngest ancestor, ya, that may become unbalanced upon 
insertion. The algorithm makes use of the function makerree described previously and 
routines rightrotation and leftrotation, which accept a pointer to the root of a subtree 
and perform the desired rotation. > 
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Figure 7.2.7 After rebalancing, all 


(b), balances are after insertion. 
/* PART I: search and insert into the binary treg */ 
fp = null; 
p = tree; 
fya = null; 
ya = p; 


/* ya points to the youngest “ancestor which may become */ 
/* unbalanced. fya points to the father of ya, and fp */ 
/* points to the father of p. ef 
while (p != null) { 
if (key == k(p)) 
return(p); 
9 = (key < kK(p)) ? Jeft(p) : right(p); 
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if (q != nul) 
if (bal(q) != 0) { 
fya = p; 
ya = q; 
} /* end if */ 
fp = p; 
p=q; 
} /* end while */ 
* insert new record */ 
q = maketree(rec, key); 
bal(q) = 0; 
(key < k(fp)) ? left(fp) = q : right(fp) = q; 
/*. the balance on all nodes between node( ya) and node(q) */ 


/* must be changed from 0 */ 
p = (key < k(yay) ? left(ya) : right(y ): 
S= Dp; 


while (p != q) { 
if (key < k(p)) { 
bal(p) = 1; 
p= left(p); 


else { 
bal(p) = -1; 
p = right(p); 

} /* end if */ 
} /* end while */ 
1: PART II: ascertain whether or not the tree is i’, 
/* unbalanced. If it is, q is the newly inserted node, */ 
/* ya is its youngest unbalanced ancestor, fya is the */ 
/* father of ya and s is the son of ya in the direction */ 
a of the imbalance kj 
imbal = (key < k(ya)) ? 1: -1; 
if (bal(ya) == 0) { ; 

/* another level has been added to the tree */ 

f* the tree remains balanced */ 

bal(ya) = imbal; 

return(q); 
} /* end if */ 
if (bal(ya) != imbal) { 

/* the added node has been placed in the */ 

/* opposite direction of the imbalance * 

/* the tree remains balanced s/ 

bal(ya) = 0; 

return(q); 

/* end if */ 
/* PART III: the additional node has unbalanced the tree xy 
/* rebalance ©. oy performing the required rotations and sj 
/* then adjust the balances of the nodes involved +y 
if (bal(s) == imbal) { 

/* ya and s have been unbalanced in the same di rection; */ 
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Mh see Figure 7.2.5a where ya = a and s = b */ 
pes; 
if (imbal == 1) 
rightrotation(ya); 
else 
leftrotation(ya); 
bal(ya) = 0; 
bal(s) = 0; 
} 
else { 
/* ya and s are unbalanced in opposite directions; */ 
Yo see Figure 7.2.5b ay 
if (imbal == 1) { 
p = right(s); 
leftrotation(s); 
left(ya) = p; 
rightrotation(ya); 


> } 


else { 
p = left(s); 
right(ya) = p; 
rightrotation(s); 
leftrotation(ya); 
} /* end if */ 
/* adjust bal field for involved nodes */ 
if (bal(p) == 0) { 
/* p was inserted node */ 
bal(ya) = 0; 
bal(s) = 0; 


else 
if (bal(p) == imbal) { 
/* see Figures 7.2.5b and 7.2.7b  +/ 
bal(ya) = -imbal; 
bal(s) = 0; 


else { 


/* ` see Figures 7.2.5b and 7.2.7b  +/ 
/* but the new node was inserted into t3 */ 
bal(ya) = 0; 
bal(s) = imbal; 
} /* end if */ 
bal(p) = 0; 
} /* end if */ 
/* adjust the pointer to the rotated subtree i 
if (fya == null) 
tree = p; 
else 
(ya == right(fya)) ? right(fya) = p: left(fya) = p; 
return(q); 
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The maximum height of a balanced binary search tree is 1.44 log) n, so that a 
search in such a tree never requires more than 44 percent more comparisons than that 
for a completely balanced tree. In actual practice. balanced binary search trees behave 
even better, yielding search times of log2 n + 0.25 for large n. On the average, a rotation 
is required in 46.5 percent of the insertions. 

The algorithm to delete a node from a balanced binary search tree while main- 
taining its balance is even Fhore complex. Whereas insertion requires at most a double 
Totation, deletion may require one (single or double) rotation at each level of the tree, 
or O(log n) rotations. However, in practice, an average of only 0.214 (single or double) 
rotations has been found to be required per deletion. 

The balanced binary search trees that we have looked at are called ‘height- 
balanced trees because their height is used as'the criterion for balancing. There are a 
number ofother ways of defining balanced trees. In one method. the weight of a tree 
is defined as the number of external nodessin the tree (this equals the number of null 
pointers in the tree). If the ratio of the weight of the left subtree of every node to the 
weight of the subtree rooted at the node is between some fraction a and | — a, the 
tree is a weight-balanced tree of ratio a or is said to be in the class wh[a}. When an 
ordinary insertion or deletion on a tree in class wba] would remove the tree from the 
class, rotations are used to restore the weight-balanced. property. 

Another type of tree, called a balanced binary tree by Tarjan, requires that for 
every node nd, the length of the longest path from nd to an external node is at most 
twice the length of the shortest path from nd to an external node. (Recall that external 
nodes are nodes added. to the tree at every null pointer.) Again, rotations are used to 
maintain balance after insertion or deletion. Tarjan’s balanced trees have the property 
that at most one double and one single rotation restore balance after either insertion or 
deletion, as opposed to a possible O(log n) rotations for deletion in a height-balance 

Balanced trees may also be used for efficient implementation of priority queue 
(see Sections 4.1 and 6.3). Inserting a new element requires at most O(log n) steps 
to find its position and O(1) steps to access the element (by following left pointers to 
the leftmost leaf) and O(log n) or O(1) steps to'delete that leaf. Thus. like a priority 
queue implemented using a heap (Section 6.3), a priority queue implemented using 


a balanced tree can perform any sequence of n insertions and minimum deletions in 
O(n log n) steps. 


EXERCISES 


7.2.1. Write an efficient insertion algorithm for a binary search tree to insert a new record 
whose key is known not to exist in the tree. : 


7.2.2. Show that it is possible to obtain a binary search tree in which only a single leaf exists 
even if the c.ements of the tree are not insérted in strictly ascending or descending 
order. d 

7.2.3. — Ve 


erify by simulation that if records are presented to the binary tree séarch and insertion 
algorithm in random order, the number of key comparisons is O(log n). 


- 
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(a) 


(c) 


Figure 7.2.8 


7.2.4, Prove that every n-node binary search tree is not equally likely (assuming items arc 


inserted in random order), and that balanced trees are more probable than straight-line 
trees. 


7.2.5. Write an algorithm to delete a node from a binary tree that replaces the node with its 
inorder predecessor, rather than its inorder successor. 


7.2.6. Suppose that the node type of a binary search tree is defined as follows: 


struct nodetype { 
int k; 
int r; 
Struct nodetype *left; 
Struct nodetype *right; 
} Š 


The k and r fields contain the key and record of the node; left and right are pointers to 
the node’s sons. Write a C routine sinsert(tree, key, rec) to search and insert a record 
rec with key key into a binary search tree pointed to by tree. 
7.2.7. Write a C routine sdelete(tree, key) to search and delete a record record with key key 
from a binary search tree implemented as in the previous exercise. If such a record 
is found, the function returns the value of its r field; if it is not found. the function 
returns 0. ee A ; 
7.2.8. Write a C routine delete(tree, kev), key2) to delete all records with keys between key! 
and key? (inclusive) from a binary search tree whose nodes are declared as in the pre- 
vious enercises. 
7.2.9. Consider the search trees of Figure 7.2.8. í 
(ay How many permutations of the integers 1 through 7 would produce the binary 
search trees of Figure 7.2.8a, b, and c, respectively? 

(b) How many permutations of the integers 1 through 7 produce binary search trees 
similar to the trees of Figure 7.2.8a, b, and c, respectively? (See Exercise 5.1.9.) 

(c) How many permutations of the integers 1 through 7 produce binary search trees 
with the same number of nodes at each level as the trees of Figure 7.2.8a. b. and 
c, respectively? 

(d) Find an assignment of probabilities to the first seven positive integers as search 
arguments that makes each of the trees of Figure 7.2.8a. b, and c optimum. 

7.2.10. Show that the Fibonacci tree of order h + 1 (see Exercise 5.3.5) is a height-balanced 


tree of height h and has fewer nodes than does any other height balanced tree of 
height A. 


7.3 GENERAL SEARCH TREES 


` General nonbinary trees are also used as search tables, particularly in external storage. 
There are two broad categories of such trees: multiway’search trees and digital search 
trees. We examine each in turn. 
Multiway Searc') Trees 


In a binary search tree, each node nd contains a single key and points to nwo 
subtrees. One of these subtrees contains all the keys in the tree rooted at nd that are less 
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than the key in nd, and the other subtree contains all the keys in the tree rooted at nd 
that are greater than (or equal to) the key in nd. 

We may extend this concept to a general search tree in which each node contains 
one or more keys. A multiway search tree of order nis a general tree in which each 
node has n or fewer subtrees and contains one fewer key than it has subtrees. That is, 
if a node has four subtrees, it contains three keys. In addition, if so, 5},..., Sm-] are 
the m subtrees of a node containing keys ko, kis... , km-2 in ascending order, all keys 
in subtree so are less than or equal to kg, all keys in the subtree s j (where j is between 
l and m — 2) are greater than kj-; and less than or equal to kj, and all keys in the 
subtree Sm-; are greater than km-2. The subtree s j is called the left subtree of key k j 
and its root is called the left son of key k j+ Similarly, sj is called the right subtrze, 
and its root the right'son, of key kj-,. One or more of the subtrees of a node may be 
empty. (Sometimes, the term “multiway search tree” is used to refer to any nonbinary 
tree used for searching, including the digital trees that we introduce at the end of this 
section. However, we use the term strictly for trees that can contain complete keys in 
each node.) 

Figure 7.3.1 illustrates a number of multiway search trees. Figure 7.3.la is a mul- 
tiway search tree of order 4. The eight nodes of that tree have been labeled A through H. 
Nodes A, D, E; and G contain the maximum number of subtrees, 4, and the maximum 
number of keys, -3. Such nodes are called full nodes. However, some of the subtrees 
of nodes D and E and all of the subtrees of node G are empty, as indicated by arrows 
emanating from the appropriate positions in the nodes. Nodes B, C, F, and H are not full 
and also contain some empty subtrees. The first subtree of A contains the keys 6 and 
10, both smaller than 12, which is the first key of A. The second subtree of A contains 
25 and 37, both greater than 12 (the first key of A) and less than 50 (the second key). 
The third subtree contains 60, 70, 80, 62, 65, and 69, all of which are between 50 (the 
second key of A) and 85 (the third key). Finally, the last subtree of A contains 100, 120, 
150, and 110, all greater than 85, the last key in node A. Similarly, each subtree of any 
other node contains only keys between the appropriate two keys of that node and its 
ancestors. 

Figure 7.3.1b illustrates a top-down multiway search tree of order 3. Such a tree 
is characterized by the condition that any nonfull node is a leaf. Note that the tree of 
Figure 7.3.1a is not top-down, since node C is nonfull, yet contains a nonempty subtree. 
Define a semileaf as a node with at least one empty subtree. In Figure 7.3.1a, nodes B 
through H are all semileaves. In Figure 7.3.1b, nodes B through G and J through R are 
semileaves. In a top-down multiway tree. a semileaf must be either full or a leaf. 

Figure 7.3.1 is yet another multiway search tree of order 3. It is not top-down, 
since there are four nodes with only one key and nonempty subtrees. However, it does 
have another special property in that it is balanced. That is, all its semileaves are at the 
same level (3). This implies that all semileaves are leaves. Neither the tree of Figure 
7.3.1a (which has leaves at levels 1 and 2) nor that of Figure 7.3.1b (leaves at levels 2, 
3, and 4) are balanced multiway search trees. (Note that although a binary search tree 
is a multiway search tree of order 2, a balanced binary search tree as defined at the end 
of Section 7.2 is not necessarily balanced as a multiway search tree, since it can have 
leaves at different levels.) 
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Searching a Multiway Tree i 

The algorithm to search a multiway search tree, regardless ọf whether it is top- 
down, balanced, or neither, is straightforward. Each node contains a single integer 
field, a variable number of pointer fields, and a variable number of key fields. If 
node(p) is a node, the integer field numtrees(p) equals the number of subtrees of 
_ node(p). numtrees(p) is always less than or equal to the order of the tree, n. The 
pointer fields son(p,0) through son(p, numtrees(p) — 1) point to the subtrees of 
node(p). The key fields k(p,0) through k(p, numtrees(p) — 2) are the keys contained in 
node(p) in ascending order. The subtree to which son(p,i) points (for i between 1 and 
numtrees(p) — 2 inclusive) contains all keys in the tree between k(p,i — 1) and k(p,i). 
son(p,0) points to a subtree containing only keys less than k(p,0) and son(p, numtrees(p) 
— 1) points to a subtree containing only keys greater than k(p, numtrees(p) — 2). 

We also assume a function nodesearch(p, key) that returns the smallest integer j 
such that key <= k(p,j), or numtrees(p) — 1 if key is greater than all the keys in node(p). 
(We will discuss shortly how nodesearch is implemented.) The following recursive 
algorithm is for a function search(tree) that returns a pointer to the node containing 
key.(or — 1 [representing null] if there is no such node in the tree) and sets the global 
variable position to the position of key in that node: 


p = tree; 

if (p == null) { 
position = -1; 
return(-1); 

} /* end if */ 

i = nodesearch(p, key); 

if (i < numtrees(p) - 1°@& key == k(p,i)) { 
position = 7; 
return(p); 

} /* end if */ 3 

return(search(son(p,i))); 


Note that after setting i to nodesearch(p, key), we insist on checking that 
i < numtrees(p) — 1 before accessing k(p,i). This is to avoid using the possibly nonex- 
istent or erroneous k(p, numtrees(p) — 1), in case key is greater than all the keys in 
node(p). The following is a nonrecursive version of the foregoing algorithm: 


p = tree; 
while (p "= null) { 
/* search the subtree rooted at node(p) */ 
i = nodesearch(p, key); 
if (i < numtrees(p) - 1 & key == k(p,i)) { 
... position = i; 
return(p); 
} /* end-if */ 
p = son(p,i); 
} /* end while */ 
position = -1; 
return(-1); 
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The function nodesearch is responsible for locating the smallest key in a node 
greater than or equal to the search argument. The simplest technique for doing this 
is a sequential search through the ordered set of keys in the node. If all keys are of 
fixed equal length, a binary search can also be used to tocate the appropriate key. The 
decision whether to use a sequential or binary search depends on the order of the tree, 
which determines how many keys must be searched. Another Possibility is to organize 
the keys within the node as a binary search tree. 


Implementing a Multiway Tree 


Note that we have implemented a multiway search tree of order n using nodes 
with up to sons rather than as a binary tree with son and brother pointers, as outlined 
in Section 5.5 for general trees. The reason for this is that in multiway trees, unlike in 
general trees, there is a limit to the number of sons of a node, and we can expect most 
nodes to be as full as possible. In a general tree there was no such limit, and many nodes 
might contain only one or two items. Therefore the flexibility of allowing as many or as 
few items in a node as necessary and the space saving when a node was nearly empty 
was worth the overhead of extra brother pointers. 

` Nevertheless, when nodes are not full, multiway search trees as implemented here 
do waste considerable storage. Despite this possible waste of storage, multiway trees 
are frequently used, especially to store data on an external direct-access device such 
as a disk. The reason for this is that accessing each new node during a search requires 
reading a block of storage from the external device. This read operation is relatively 
expensive in terms of time because of the mechanical work involved in positioning the 
device properly. However, once the device is positioned, the task of actually reading 
a large amount of sequential data is relatively fast. This means that the total time for 
reading a storage block (that is, a “node”) is Only minimally affected by its size. Once a 
node has been read and is contained in internal computer memory, the cost of searching 
it at electronic internal speeds is minuscule compared with the cost of initially reading it 
into memory. In addition, external storage is fairly inexpensive so that a technique that 
improves time-efficiency at the expense of external Storage space utilization is real-cost 
(that is, dollars) effective. For this reason, external storage systems based on multiway 
search trees try to maximize the size of each node, and trees of order 200 or more are 
not uncommon. 

The second factor to consider in implementing multiway search trees is storage of 
the data records themselves. As in any Storage system, the records may be stored with 
the keys or remotely from the keys. The first technique requires keeping entire records 
within the tree nodes, whereas the second requires keeping a pointer to the associated 
record with each key in a node. (Still another technique involves duplicating keys and 
keeping records only at the leaves. This mechanism is discussed later in more detail 
when we discuss B*-trees.) 

In general we would like tu keep as many keys as possible in each node. To see 
why this is so, consider two top-down trees with 4000 keys and minimum depth, one 
of order, $ and the othe: of order 11. The order-5 tree requires 1000 nodes (of 4 keys 
each) to hold the 4000 keys. whereas the order-1] tree requires only 400 nodes (of 10 
keys each). The depth of the order-5 tree is at least 5 (level 0 contains | node, level | 
contains 5, level 2 contains 25, level 3 contains 125. level 4 contains 625, and level 5 
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contains the remaining 219), whereas the depth of the order-11 tree can be as low as 3 
(level 0 contains 1 node, level 1 contains 11, level 2 contains 121, and level 3 contains 
the remaining 267). Thus 5 or 6 nodes must be accessed in searching the order-5 tree 
for most of the keys, but only 3 or 4 nodes must be accessed for the order-11 tree. 
But as we noted above, accessing a node is the most expensive operation in searching 
external storage, where multiway trees are most used. Thus a tree with a higher order 
leads to a more efficient search process. The actual storage required by both situations 
is approximately the same, since, although fewer large nodes are required to hold a file 
of a given size when the order is high, each node is larger. 

Since the size of a node is usually fixed by other external factors (for example, 
the amount of storage physicu!ly read from disk in one operation), a higher order tree 
is obtained by keeping the records outside the tree nodes. Even if this causes an extra 
external read to obtain a record after its key has been located, keeping records within a 
node typically reduces the order by a factor of between 5 and 40 (which is the typical 
range of the ratio of record size to key size), so the trade-off is not worthwhile. 

If a multiway search tree is maintained in external storage, a pointer to a node is an 
external storage address that specifies the starting point of a storage block. The block of 
storage that makes up a node must be read into internal storage before any of the fields 
numtrees, k, or son can be accessed. Assume that the routine directread( (p, block) reads 
a node at external storage address p into an internal storage buffer block. Assume also 
that the numtrecs, k, and son fields in the buffer are accessed by the C-like notations 
block.numtrees, block.k, and block.son. Assume also that the function nodesearch is 
modified to acceptan (internal) storage block rather than a pointer [that is, it is invoked 
by nodesearch(block, key) rather than by nodesearch(p, key)). Then the following is a 
nonrecursive algorithm for searching an externally stored multiway search tree: 


p = tree; 
while (p != null) { 
directread(p, block); 
i = nodesearch(block, key); 
if (i < block.numtrees - 1 & key == block.son(i)) { 
position = i; 
return(p); 
} /* end if */ 
p = block.son(i); 
J /* end while */ 
position = -1; 
return(-1); 


The algorithm aiso sets block to the node at external address p. The record associated 
with key or a pointer to it may be found in block. Note that null as used in this algorithm 
references a null external storage address rather than the C pointer null. 


“Traversing a Multiway Tree 
A common operation on a data structure is traversal: accessing all the elements of 


the structure in a fixed sequence. The following is a recursive algorithm traverse(tree) 
to traverse a multiway tree and print its keys in ascending order 
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if (tree != null) { 
nt = numtrees(tree); 
for (i = 0; i < nt - 1; i+) { 
traverse(son‘tree, i)); 
printf("Xd", k(tree, i)); 
} /* end for */ 
traverse(son(tree, nt)); 
} /* end if */ 


In implementing the recursion, we must keep a stack of pointers to all the nodes in a 
path beginning with the root of the tree down to the node currently being visited: 

If each node is a block of external Storage and tree is the root node’s external 
Storage address, a node must be read into internal memory before its son or k fields can 
be accessed. Thus the algorithm becomes 


if (tree != null) { 
directread(tree, block); 
nt = block.numtrees; 
for (i = 0; i < nt - 1; i++) { 
traverse(block.son(i)); 
printf("%d", block.k(i)); 
} /* end for */ 
traverse(block.son(nt)); 
} /* end if */ 


where directread is a system routine that reads a block of storage at a particular ex- - 
ternal address (tree) into an internal memory buffer (block). This requires keeping a 
stack of buffers as well. If d is the depth of the tree, d + 1 buffers must be kept in 
memory. 

Alternatively, all but one of the buffers can be eliminated if each node contains 
two additional fields: a father field pointing to its father, and an index field indicating 
which son of its father the node is. Then when the last subtree of a node has been 
traversed, the algorithm uses the father field of the node to access its father and its 
index field to determine which key in the father node to output and which subtree 
of the father to traverse next. However, this would require numerous reads for each 
node and is probably not worth the savings in buffer space, especially since a high- 
order tree with a very large number of keys requires very low depth. (As-previously 
illustrated, a tree of order 11 with 4000 keys can be accommodated comfortably with 
depth 3. A tree of order 101 can accommodate over a million keys with a depth 
of only 2.) 

Another common operation, closely related to traversal. is direct sequential ac- 
cess. This refers to ac ‘essing the next key following a key whose location in the tree is 
known. Let us assume : , it we have located a key k1 by searching the tree and that it is 
located at position k(1, i1), Ordinarily. the successor of 41 can be found by executing 
the following routine next(nl. i1). Gnillkey is a special value indicating that a proper 
key cannot be found.) 
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p = son(nl, il + 1); a 
q = null; /* qis one node behind p */ 
while (p != null) { 
q = p; 
p = son(p, 0); 
} /* end while */ 
if (q != null) 
return(k(q, 0)); 
if (il < numtrees(n1) - 2) 
return(k(nl, il + 1)); 
return(nullkey); 


This algorithm relies on the fact that the successor of k1 is the first key in the 
subtree that follows k1 in node(n1), or if that subtree is empty [son(n1,i1 + 1) equals 
null) and if k1 is not the last key in its node (il < numtrees(n1) — 2), the successor is 
the next key in node(n1). 

However, if k1 is the last key in its node and if the subtree following it is empty, 
its successor can only be found by backing up the tree. Assuming father and index 
fields in each node as outlined previously, a complete algorithm successor(n1,il) to 


find the successor of the key in position i1 of the node pointed to by n1 may be written 
as follows: , 


p = son(nl, ii +1); 

if (p != null il < numtrees(n1) - 2) 
/* use the previous algorithm */ 
return(next(nl, i1)); 

f = father(nl); 

i = index(nl); 

while (f != nul] & i == numtrees(f) - 1) { 
i = index(f); 
f = father(f); 

} /* end while */ 

if (f == nu) 
return(NULLKEY); 

return(k(f, 7)); 


Of course, we would like to avoid backing up the tree whenever possible. Since 
a traversa] beginning at a specific key is quite common, the initial search process is 
often modified to retain, in internal memory, all nodes in the path from the tree root 
to the located key. Then, if the tree must be backed up, the path to the root is readily 
available. As noted previously, if this is done, the fields father and index are not needed. 

Later in this section, we examine a specialized adaptation of a multiway search 
tree, called a B*-tree, that does not require a stack for efficient sequential traversal. 


Insertion in a Multiway Search Tree 


Now that we have examined how to search and traverse multiway search trees, 
let us examine insertion techniques for these structures. We examine two insertion tech- 
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niques for multiway search trees. The first is analogous to binary search tree insertion 
and results in a top-down multiway search tree. The second is a new insertion tech- 
nique and produces a special kind of balanced multiway search tree. It is this second 
technique, or a slight variation thereof, that is most commonly used in direct-access 
external file storage systems. a 

For convenience we assume that duplicate keys are not permitted in the tree, so 
that if the argument key is found in the tree no insertion takes place. We also assume 
that the tree is nonempty. The first step in both insertion procedures is to search for the 
argument key. If the argument key is found in the tree, we return a pointer to the node 
containing the key and set the variable position to its position within the node, justas in 
the search procedure presented previously. However, if the argument key is not found, 
we return a pointer to the semileaf node(s) that would contain the key if it were present, 
and positión is set to the index of the smallest key in node(s) that is greater than the 
argument key (that is, the position of the argument key if it were in the tree). If all the 
keys in node(s) are less than the argument key, position is set to numtrees(s) — 1. A 
variable, found, is set to true or false depending on whether the argument key is or is 
not found in the tree. 

Figure 7.3.2 illustrates the result of this procedure for an order-4 top-down bal- 


anced multiway search tree and several search arguments. The algorithm for find is 
straightforward: 


q = null; 
p = tree; 
while (p != null) { 
i = nodesearch(p, key); 
q = P; 
if (i < numtrees(p) - 1 & key == k(p, i)) { 
found = TRUE; 
position = 7; 
return(p); /* the key is found in node(p) */ 
} /* end if */ ; 
p = son(p, i); 
} /* end while */ 
found = FALSE; 
position = 7; 
return(q); /* pis null. q points to a semileaf */ 


To implement this algorithm in C, we would write a function find with the fol- 
lowing header:. 


NODEPTR find(NODEPTR tree, KEYTYPE key, int *pposition, int *pfound) 


References to position and ‘found in the algorithm are replaced by references to 
*pposition and *pfour:-’. respectively, in the C function. 

Let us assume that s is the node pointer returned by find. The second Step of the - 
insertion procedure applies only if the key is not found (remember, no duplicate keys 
are permitted) and if node(s) is not full (that is, if numtrees(s) < n, where n is the order 
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of the tree). In Figure 7.3.2; this applies to cases d and f only. The second step consists 
of inserting the new key (and record) into node(s). Note that if the tree is top-down or 
balanced, a nonfull semileaf discovered by find is always a leaf. Let insrec(p, i, rec) be 
a routine to insert the record rec in position i of node(p) as appropriate: Then the second 
step of the insertion process may be described as follows: ‘ 


nt = .numtrees(s); 

numtrees(s) = nt + 1; 

for (i = nt - 1; i > position; i--) 
kls, i) = k(s, 7 - 1); 

k(s, position) = key; 

insrec(s, position, rec); 


We call this function insleaf (s, position, key, rec). 

Figure 7.3.3a illustrates the nodes located by the find procedure in Figure 7.3.2d 
and f with the new keys inserted. Note that it is unnecessary to copy the son pointers 
associated with the keys being moved, since the node a: a leaf, so that all pointers are 
null. We assume that they were initialized to null when the node was initially added to 
the tree. 

If step 2 is appropriate (that is, if a nonfull leaf node where the key can be inserted 
has been found), both insertion routines terminate. The two techniques differ only in 
the third step, which is invoked when the find procedure locates a full semileaf. 

The first insertion technique, which results in top-down multiway search trees,’ 
mimics the actions of the binary search tree insertion algorithm: That is, it allocates a 
new node, inserts the key and record into the new node, and places the new node as the 
appropriate son of node(s). It uses the routine maketree(key, rec) to allocate a node, set 
the n pointers in it to null, its numtrees field to 2, and its first key field to key. maketree 
then calls insrec to insert the record’ as appropriate and finally returns a pointer to the 
newly allocated node. Using maketree, the routine insfull to insert the key when the 
appropriate semileaf is full may be implemented trivially as 


p = maketree(key, rec); 
son(s, position) = p; 


If father and index fields are maintained in each node, the operations 


father(p) = s; 
index(p) = position; 


are required as well. 

Figure 7.3.3b illustrates the result of inserting keys 71 and 22, respectively, into 
the nodes located by find in Figure 7.3.2c and e. Figure 7.3.3c illustrates subsequent 
insertions of keys 86, 77. 87, 84, 85, and 73, in that order. Note that the order in which 
keys are inserted very much affects where the keys are placed. For example, consider 
what would happen if the keys were inserted in the order 85, 77, 86, 87, 73, 84. 

Note also that this insertion technique can transform a leaf into a nonleaf (although 
it remains a semileaf) and therefore unbalances the multiway tree. It is therefore possi- 
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ble for successive insertions to produce a tree that is heavily unbalanced and in which 
an inordinate number of nodes must be accessed to locate certain keys. In practical 
situations, however, multiway search trees created by this insertion technique, althoùgh 
not completely balanced, are not too greatly unbalanced, so that too many nodes are not 
accessed in searching for a key in a leaf. However, the technique does have one major 
drawback. Since leaves are created containing only one key, and other leaves may be 
created before previously created leaves are filled, multiway trees created by successive 
insertions in this manner waste much space with leaf nodes that are nearly empty. 
Although this insertion method does not guarantee balanced trees, it does guar- 
antee top-down trees. To see this, note that a new node is not created unless its father 


Searching Chap. 7 
434 


_ is full. Thus any nonfull node has no descendants and is therefore a leaf, which by 

definition implies that the tree is top-down. The advantage of a top-down tree is that 
the upper nodes are full, so that as many keys as possible are found on short paths. 

Before examining the second insertion technique, we put all the pieces of the 


first technique together to form a complete search and insertion algorithm for top-down 
multiway search trees. 


if (tree == null) { 
tree = maketree(key, rec); 
position = 0; 
return (tree); 
.} /* end if */ 
£ = find(tree, key, position, found); 
if (found == TRUE) 
return (s); 
if (numtrees(s) < n) { 
insleaf(s, position, key, rec); 
return(s); 
} /* end if */ 
p = maketree(key, rec); 
son(s, position) = p; 
position = 0; 
return (p); 


B-Trees 


The second insertion technique for multiway search trees is more complex. Com- 
pensating for this complexity, however, is the fact that it creates balanced trees, so that 
the maximum number of nodes accessed to find any particular key is kept small. In 
addition, the technique yields one further bonus, in that all nodes (except for the root) 
in a tree created by this technique are at least half full, so. that very little storage space 
is wasted. This last advantage is the primary reason that the second insertion technique 
(or a variation thereof) is used so fréquently in actual file systems. 

A balanced order-n multiway search tree in which each nonroot node contains at 
least (n—1)/2 keys is called a B-tree of order n. (Note that the slash denotes integer 
division so that a B-tree of order 12 contains at least 5 keys in each nonroot node, as 
does a B-tree of order 11.) A B-tree of order n is also called an n-(n — 1) tree or an 
(n — 1)-n tree. (The dash outside the parentheses is a hyphen while the dash inside 
the parentheses is a minus sign.) This reflects the fact that each node in the tree has 
a maximum of n — 1 keys and n sons. Thus, a 4-5 tree is a B-tree of order 5, as is 
a 5-4 tree. In particular, a 2-3 (or 3-2) tree is the most elementary nontrivial (that is, 
nonbinary) B-tree, with one or two keys per node and two or three sons per node. 

(At this point, we should say something about terminology. In discussing B-trees, 
the word “order” is used ..:fferently by different authors. It is common to find the order 
of a B-tree defined as the minimum number of keys in a nonroot node [that is, (n—1)/2], 
and the degree of a B-tree to mean the maximum number of sons [that is, n]. Still other 
authors use “order” to mean the maximum number of keys in a node [that is, n — 1). 
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We use order consistently for all multiway search trees to mean the maximum number 
of sons.) 

- The first two steps of the insertion technique are the same for B-trees as for top- 
down trees. First, use find to locate the leaf into which the key should be inserted, and 
second, if the located leaf is not full, add the key using insleaf. It is in the third step, 
when the located leaf is found to be full, that the methods differ. Instead of creating a 
new node with only one key, split the full leaf in two: a left leaf and a right leaf. For 
simplicity, assume that n is odd. The n keys consisting of the n — 1 keys in the full 
leaf and the new key to be inserted are divided into three groups: the lowest n/2 keys 
are placed into the left leaf, the highest n/2 keys are placed into the right leaf, and the 
middle key [there must be a middle key since 2 * (n/2) equals n — 1 if n is odd] is 
placed into the father node if possible (that is, if the father node is not full). The two 
pointers on either side of the key inserted into the father are set to the newly created 
left and right leaves, respectively. 

Figure 7.3.4 illustrates this process on a B-tree of order 5. Figure 7.3.4a shows 
a subtree of a B-tree, and Figure 7.3.4b shows part of the same subtree as it is altered 
by the insertion of 382. The leftmost leaf was already full, so that the five keys 380, 
382, 395, 406, and 412 are divided so that 380 and 382 are placed in a new left leaf, 
406 and 412 are placed in a new right leaf, and the middle key, 395, is advanced to the 
father node with pointers to the left and ri ght leaves on either side. There is no problem 
placing 395 in the father node, since it contained only two keys and has‘room for four. 

Figure 7.3.4c shows the same subtree with first 518 and then 508 inserted. (The 
same result would be achieved if they were inserted in reverse order.) It is possible 
to insert 518 directly in the rightmost leaf, since there is room for one additional key. 
However, when 508 arrives, the leaf is already full. The five keys 493, 506, 508, 511, 
and 518 are divided so that the lower two (493 and 506) are placed in a new left leaf, 
the higher two (511 and 518) in a new right leaf, and the middle key (508) is advanced 
to the father, which still hag room to accommodate it. Note that the key advanced to 
the father is always the middle key, regardless of whether it arrived before or after the 
other keys. 

If the order of the B-tree is even, the n — 1 keys (excluding the middle key) 
must be divided into two unequal-sized groups: one of size n/2 and the other of size 
(n — 1)/2. [The second group is always of-size (n — 1)/2. regardless of whether n is 
odd or even, since when n is odd, (n — 1)/2 equals n/2.] For example, if n equals 10, 

_ 10/2 (or 5) keys are in one group, 9/2 (or 4) keys are in the other group, and one key 
is advanced, for a total of 10 keys. These may be divided so that the larger-sized group 
is always in the left leaf or the right leaf, or divisions may be alternated so that on one 
split, the right leaf contains more keys and on the next split, the left leaf contains more 
keys. In practice, it makes little difference which technique is used. 

Figure 7.3.5 illustrates both left and right biases in a B-tree of order 4. Note that 
whether ‘a-left or right bias is chosen determines which key is to be advanced to the 
father. 

One basis on which the decision can be made as to whether to leave more keys 
in the left or right leaf is to examine the key ranges under both possibilities. In Figure 
7.3.5b, utilizing a left bias, the key range of the left node is 87 to 102, or 15, and the 
key range of the right node is 102 to 140, or 38. in Figure 7.3.5c, utilizing a right bias, 
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Figure 7.3.5 


the key ranges are 13 (87 to 100) and 40 (100 to 140). Thus we would select a left bias 
in this case, since it more nearly equalizes the probability of a new key going into the 
left and right, assuming a uniform distribution of keys. 

The entire discussion thus far has assumed that there is room in the father for the 
middle node to be inserted. What if the father node, too, is full? For example, what 
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happens with the insertion of Figure 7.3.2c, where 71 must be inserted into the full node 
L, and C, the father of L, is full as well? The solution is quite simple. The father node 
is also split in the same way, and its middle node is advanced to its father. This process 
continues until a key is inserted in a node with room, or the root node, A, itself is split. 
When that happens, a new root node NR is created containing the key advanced from 
the splitting of A, and with the two halves of A as sons. 

Figures 7.3.6 and 7.3.7 illustrate this process with the insertions of Figure 7.3.2c 
and e. In Figure 7.3.6a, node L is split. (We assume a left bias throughout this illustra- 
tion.) The middle element (75) should be advanced to C, but C is full. Thus, in Figure 
7.3.6b, we see C being split as well. The two halves of L are now made sons of the 
appropriate halves of C. 25, which had been advanced to C, must now be advanced to 
root node A, which also has no room. Thus, A itself must be split, as shown in Figure 
7.3.6c. Finally, Figure 7.3.6d shows a new root node, NR, established, containing the 
key advanced from A and two pointers to the two halves of A. 

Figure 7.3.7 illustrates the subsequent insertion of 22, as in Figure 7.3.2e. In that 
figure, 22 would have caused a split of nodes G, B, and A. But, in the meantime, A has 
already been split by the insertion of 71, so the insertion of 22 proceeds as in Figure 
7.3.7. First G is split and 20 is advanced to B (Figure 7.3.7a), which is split in turn 
(Figure 7.3.7b). 25 is then advanced to Al, which is the new father of B. But since 
A1 has room, no furthur splits are necessary. 25 is inserted into A! and the insertion is 
complete (Figure 7.3.7c). 

As a final illustration, Figure 7.3.8 shows the insertion of several keys into the 
order-5 B-tree of Figure 7.3.4c. It would be worthwhile for you to generate a list of 
keys and continually insert them into an order-5 B-tree to see how it develops. 

Note that a B-tree grows in depth through the splitting of the root and the cre- 
ation of a new root, and in width by the splitting of nodes. Thus B-tree insertion into 
a balanced tree keeps the tree balanced. However, a B-tree is rarely top-down. since 
when a full nonleaf node splits, the two nonleaves created are not full. Thus, while the 
maximum number of accesses to find a key is low (since the tree is balanced), the aver- 
age number of such accesses may be higher than in a top-down tree in which the upper 
levels are always full. In simulations, the average number of accesses in searching a 
random top-down tree has indeed been found slightly lower than in searching a random 
B-tree because random top-down trees are generally fairly balanced. 

One other point to note in a B-tree is that older keys (those inserted first) tend to 
be closer to the root than younger keys, since they have had more opportunity to be ad- 
vanced. However, it is possible for a key to remain in a leaf forever even if a large num- 
ber of locally lower and higher keys are subsequently inserted. This is unlike a top-down 
tree in which a key in an ancestor node must be older than any key in a descendant node. 


Algorithms for B-Tree insertion 


As you might imagine, the algorithm for B-tree insertion is fairly involved. 
To simplify matters +- «porarily, let us assume that-we can access a pointer to the 
father of node(nd) by referring to father(nd) and the position of the pointer nd in 
node(father(nd)) by index(nd ), sothat son( father(nd ), index(nd)) equals nd. (This can 
be implemented most directly by adding father and index fields to each node, but there 
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are complications to such an approach that we discuss shortly.) We also assume that 
r(p,i) is a pointer to the record associated with the key k(p,i). Recall that the routine find 
returns a pointer to the leaf in which the key should be inserted and sets the variable 
position to the position in the leaf at which the key should be inserted. Recall also that 
key and rec are the argument key and record to be inserted. 

The function insert(key, rec, s, position) inserts a record into a B-tree. It is called 
following a call to find if the output parameter found of that routine is false (that is, the 
key is not already in the tree), where the parameter s has been set to the node pointer 
returned by, find (that is, the node where key and rec should be inserted). The routine 
uses two additional auxiliary routines, which will be presented shortly. The first routine, 
split accepts five input parameters: nd, a pointer to a node to be split; pos, the position 
in node(nd) where a key and record are to be inserted; newkey and newrec, the key and 
record being inserted (this key and record might be the ones being advanced from a 
previously split node or might be the new key and record being inserted into the tree): 
and newnode, a pointer to the subtree that contains the keys greater than newkey (that 
is, a pointer to the right half of a node previously split), which must be inserted into 
the node currently being split. To maintain the proper number of sons within a node, 
each time that a new key and record are inserted into a node, a new son pointer must 
be inserted as well. When a new key and record are inserted into a leaf, the son pointer 
inserted is null. Because a key and record are inserted into one of the upper levels only 
when a node is split at a lower level, the new son pointer to be inse.ted (newnode) will be 
to the right half of the node that was split at the lower level. The left half remains intact 
within the previously allocated lower-level node: split arranges newkey and the keys 
of node(nd) so that the group of n/2 smallest keys remain in node(nd), the middle key 
and record are assigned to the output parameters midkey and midrec, and the remaining 
keys are inserted into a new node, node(nd2), where nd2 is also an output parameter. 

The second routine, insnode, inserts the key newkey and the record newrec and 
the subtree pointed to by newnode into node(nd) at position pos if there is room. Recall 
that the routine maketree(key, rec) creates a new node containing the single key key and 
the record rec and all pointers null. maketree returns a pointer to the newly created node. 
We present the algorithm for insert using the routines split, insnode, ana maketree. 


nd = s$; 
pos = position; ` 
newnode = null; /* pointer to right half of split node */ 
newrec = rec; i the record to be inserted ey 
newkey = key; /* the key to be inserted gd 
f = father(nd); 
while (f != null & nurtree(nd) == n) { 

split(nd, pos, newkey. newrec, newnode, nd2, midkey, midrec); 

newnode = nd2; 

pos = index(nd); 

nd = f;. 

f = father(nc); 

newkey = midkey; 

newrec = midrec; 
} /* end while */ 
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if (numtrees(nd) < n) { | 

insnode(nd, pos, newkey, newrec, newnode); 

return; 
} /* end if */ 
/* f equals null and numtrees(nd) equals n so that nd is- */ 
i* a full root; split it and create a new root $/ 
split(nd, pos, newkey, newrec, newnode, nd2, midkey, midrec); 
tree = maketree(midkey, midrec); 
son(tree, 0) = nd; 
son(tree, 1) = nd; 


The heart of this algorithm is the routine split that actually splits a node. split 
itself uses an auxiliary routine copy(nd1, first, last, nd2), which sets a local vari- 
able numkeys to last — first + 1 and copies fields k(nd1, first) through k(nd1, last) 
into k(nd2,0) through k(nd2, numkeys), fields r(nd1, first) through r(nd1, last) (which 
contains pointers to the actual records) into r(nd2,0) through r(nd2, numkeys), and 
fields son(nd1, first) through son(nd1, last + 1) into son(nd2,0) through son(nd2, 
numkeys + 1). copy also sets numtrees(nd2) to numtrees + 1. If last < first, copy sets 
numtrees(nd2) to 1 but dogs not change any k, r,,or son fields. split also uses getnode 
to create a new node and insnode to insert a new key in a nonfull node. 

The following is an algorithm for split. The n keys contained in node(nd) and 
newkey must be distributed so that the smallest n/2 remain in node(nd), the higl-est 
(n — 1)/2 (which equals n/2 if n is odd) are placed in a new node, node(nd2), and the 
middle key is placed in midkey. To avoid recomputing n/2 on each occasion, assume 
that its value has been-assigned to the global variable ndiv2. The input value pos is the 
position in node(nd) in whith-newkey would be placed if there were room. 


/* create a new node for the right half; keep */ 
/* “the first half in node(nd) . */ 
nd2 = getnode(); 
if (pos > ndiv2) { 
/* newkey belongs to node(nd2) */ 
copy(nd, ndiv2 + 1, n - 2, nd2); 
insnode(nd2, pos - ndiv2 - 1, newkey, newrec, newnode) ; 
numtrees(nd) = ndiv2 + 1; 
midkey = k(nd, ndiv2); 
midrec = r(nd, ndiv2); 
return; i 
} /* end if */ 
if (pos == ndiv2) { 
/* newkey is the middle key */ 
.copy(nd, ndiv2, n - 2, nd2); 
numtrees(nd) = ndiv2 + 1; 
son(nd2, 0) = newnode; 
midkey = newkey; 
midrec = -newrec; 
return; 
} /* end if */ 
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if (pos < ndiv- 
/* newkey veivngs in node(nd) */ 
copy(nd, ndiv2, n - 2, nd2); 
numtrees(nd) = ndiv2; 
insnode(nd, pos, newkey, newrec, newnode); 
midkey = k(nd, ndiv2 - 1); 
midrec = r(nd, ndiv2 - 1); 
return; 
} /* end if */ 


The routine insnode(nd, pos, newkey, newrec, newnode) inserts a new record 
newrec with key newkey into position pos of a nonfull node, node(nd). newnode points 
to a subtree to be inserted to the right of the new record. The remaining keys and sub- 
trees in positions pos or greater are moved up one position. The value of numitrees(nd ) 
is increased by 1. An algorithm for insnode follows: 


for (i = numtrees(nd) - 1; i >= pos + 1; i--) { 
son(nd, i + 1) = son(nd, i); 
k(nd, i) = k(nd, i - 1); ` 
r(nd, i) = r(nd, i - 1); 

} /* end for */ 

son(nd, pos + 1) = newnode; 

k(nd, pos) = newkey; 

r(nd, pos) = newrec; 

numtrees(nd) += 1; 


Computing father and index | 


Before examining the efficiency of the insertion procedure, we must clear up one 
outstanding issue: the matter of the father and index functions. You may have noticed 
that, although these functions are utilized in the insert procedure and we suggested 
that they could be implemented most directly by adding father and index fields to each 
node, those fields are not updated by the insertion algorithm. Let us examine how this 
update could be achieved and why we chose to omit that operation. We then examine 
alternative methods of implementing the two functions that do not require the update. 

father and index fields would have to be updated each time that copy or insnode 
were called. In the case of copy, both fields in each son whose pointer is copied must be 
modified. In the case of insnode, the index field of each son whose pointer is moved must 
be modified, as well as both fields in the son being inserted. (In addition, the fields must 
be updated in the two halves of a root node being split in the insert routine.) However, 
this would impact the efficiency of the insertion algorithm in an unacceptable manner, 
especially when dealing with nodes in external storage. In the entire B-tree search and 
insertion process (excluding the update of father and index fields), at most two nodes 
at each level of the tr. are accessed. In most cases, when a split does not occur on a 
level, only one node at the level is ‘accessed. The copy and insnode operations, although 
they move nodes from one subtree to another, do so by moving pointers within one or 
two father nodes and thus do not actually require access to the son nodes being moved. 
Requiring an update to father and index fields in those sons would require accessing 
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and modifying all the son nodes themselves. But reading and writing a node from and 
to external storage are the most expensive operations in the éntire B-tree management 
process. When one considers that, in a practical information storage system, a node can 
have several hundred sons, it becomes apparent that maintaining father and index fields 
could result in a hundredfold decrease in system efficiency. 

How, then, can we obtain the father and index data required for the insertion 
process without maintaining separate fields? First, recall that the function 
nodesearch(p, key) returns the position of the smallest key in node(p) greater than 
or equal to key, so that index(nd) equals nodeseqrch( father(nd ), key). Therefore once 
father is available, index can be obtained without a separate field. , 

To understand how we can obtain father, let us look at a related problem. No B- 
tree insertion can take place without a prior search to locate the leaf where the new key 
must be inserted. This search proceeds from the root and-accesses one node at each level 
unti! it reaches the appropriate leaf. That is, it proceeds along a single path from the root 
to a leaf. The insertion then backs up along that same path, splitting all full nodes in 
the path from the leaf toward the root, until it reaches a nonfull node into which it can 
insert a key without splitting. Once that insertion is performed, the insertion process 
terminates. 

The insertion process accesses the same nodes as the search process. Since we 
have seen that accessing a node from external storage is quite expensive, it would make 
sense for the search process to store the nodes on its path together with their external 
addresses in internal memory, where the insertion process can access them without an 
expensive second read operation. But once all the nodes in a path are stored in internal 
memory, a node's father can be located by simply examining the previous node in the 
path. Thus there is no need to maintain and update a father field. 

Let us then present modified versions of find and insert to locate and insert a key 
in a B-tree. Let pathnode(i) be a copy of the ith node in the path from the root to a 
leaf, let /ocation(i) be its location (either a pointer if the tree is in internal memory, or 
an external storage address if it is in external storage), and let index(i) be the position 
of the node among the sons of its father (note that index can be determined during the 
search process and retained for use during insertion). We refer to son(i,j), k(ij) and rij) 
as the jth son, key, and record field, respectively, in pathnode(i). Similarly, numtrees(i) 
is the numtrees field in pathnode(i). 

The following find algorithm utilizes the operation access(i, loc) to copy a node 
from location loc (either from internal or external memory) into pathnode(i) and loc 
itself into /ocation(i). If the tree is stored internally, this operation consists of 


pathnede(i) = node(loc); 
location(i) = loc; 


If the tree is stored externally, the operation consists of 


directreac(loc, pathnode(i)); 
location(1) = loc; 
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where directread reads a block of storage at a particular external address (loc) into an 
internal memory buffer (pathnode(i)). We also assume that nodesearch(i, key) searches 
pathnode(i) rather than node(i). We present the algorithm find: 


null; i 


q= 
p = tree; 
J= -i 
i= -1; 
while (p != null) { 
index(++j) = 1; 
access(j, p); 
i = nodesearch(j, key); 
g=p 
if (i < numtrees(j) - 1 && key == k(j, 7)) 
break; 
p = son(j, i); 
} /* end while */ 
position = i; 
return(j); /* key is in pathnode(j) or belongs there 37 


The insertion process is modified in several places. First, insnode and copy access 
pathnode(nd) rather than node(nd). That is, nd is now an array index rather than a 
pointer, so that all references to k, son, p, and numtrees are to fields within an element 
of pathnode. The algorithms for insnode and copy do not have to be changed. 

Second, split must be modified to write out the two halves of a split node. It as- 
sumes a routine replace(i), which replaces the node at location(i) with the contents of 
pathnode(i). This routine is the reverse of access. If the tree is stored internally, it may 
be implemented by 


node(location(i)) = pathnode(i); 
and if externally, by 
directwrite(location(i), pathnode(i)); ea 


where directwrite writes a buffer in memory (pathnode(i)) into a block of external stor- 
age at a particular external address (/ocation(i)). split also uses a function makenode(i) 
that obtains a new block of storage at location x, places pathnode(i) in that block, and 
returns x. The following is a revised version of split: 


if (pos > ndiv2) { 
copy(nd, ndiv2 + 1, n - 2, nd + 1); : 
insnode(nd + 1, pos - ndiv2 - 1, newkey, newrec, newnode); 
numtrees(nd) = ndiv2 + 1; 
midkey = k(nd, ndiv2); 
midrec = r(nd, ndiv2); 
return; 

} /* end if */ 
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if (pos == ndiv2) { 
copy(nd, ndiv2, n - 2, nd + 1); 
numtrees(nd) = ndiv2; : 
son(nd + 1, 0) = newnode; 
midkey = newkey’; 
midrec = newrec; 
return; 

} /* end if */ 

if (pos < ndiv2) { ’ 
copy(nd, ndiv2, n - 2, nd + 1); 
numtrees(nd) = ndiv2; 
insnode(nd, pos, newkey, newrec, newnode); 
midkey = newkey; 
midrec = newrec; 

} /* end if */ 

replace(nd) ; 

nd2 = makenode(nd + 1); 


„Notice that nd is new a position in pathnode rather than a node pointer, and that 
pathnode(nd + 1) rather than node(nd2) is used to build the second half of the split 
node. This can be done since the node at level nd + 1 (if any) of the path has already 
been updated by the time split is called on nd, so that pathnode(nd + 1) can be reused. 
nd2 remains the actual location of the new node (allocated by makenode). (W= should 
note that it may be desirable to retain a path to the newly inserted key in pathnode if, for 
example, we wish to perform a sequential traversal or sequential insertions beginning at 
that point. In that case the algorithm must be suitably adjusted to place the appropriate ' 
left or right half of the split node at the appropriate position in pathnode. We also could 
then not use pathnode(i + 1) to build the right half but must use an additional auxiliary 
internal memory node instead. We leave the details to the reader.) 

The routine insert itself is also modified in that it uses nd — 1 rather than 
father(nd). It also calls upon replace and makenode. When the root must be split, 
maketree builds a new tree root node in internal storage. This node is placed in 
patknode(i) (which is no longer needed, since the old root node has been updated 


by split) and written out using makenode. The following is the revised algorithm for 
insert. i 


nd = s; 

pos = position; 

newnode = null; 

newrec = rec; 

newkey = key; 

while (nd != 0 & numtrees(nd) == n) { 
split(nd, pos, newkey, newrec, newnode, nd2, midkey, midrec); 
newnode = nd2; 
pos = index(nd); 
nd--; 
newkey = midkey; 
newrec = midrec; 

} /* end while */ 
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if (numtrees(nd) < n) { 
insnode(nd, pos, newkey, newrec, newnode) ; 
replace(nd) ; 
return; ~ 
} /* end if */ F 
split(nd, pos, newkey, newrec, newnode, nd?, midkey, midrec); 
pathnode(0) = maketree(midkey, midrec); 
son(0, 0) = nd; 
son(0, 1) = nd; 
tree = makenode(0); 


Deletion in Multiway Search Trees 


The simplest method for deleting a record froma multiway search tree is to retain 
the key in the tree but mark it in some way as representing a deleted record. This could 
be accomplished by setting the pointer to the record corresponding to the key to null 
or by allocating an extra flag field for each key to a ficate whether or not it has been 
deleted. The space occupied by the record itself can, of course, be reclaimed. In this 
way the key remains in the tree as a guidepost to the subtrees but does not represent a 
record within the file. Ñ 

The disadvantage of this approach is that the space occupied by the key itself 
is wasted, possibly leading to unnecessary nodes in the tree when a large number of 
records have been deleted. Extra “deleted” bits require still more space. 

Of course, if a record with a deleted key is subsequently inserted, the space for the 
key can’be recycled. in a nonieaf node, only the same key would be able to reuse the 
space, since it is too difficult to determine dynamically that the newly inserted key is 
between the predecessor and successor of the deleted key. However, in a leaf node (or, in 
certain situations, in a semileaf), the deleted key’s space can be reused by a nei ghboring 
key, since it is relatively easy to determine proximity. Since a large portion of keys are in 
leaves or semileaves, if insertions and deletions occur with equal frequency (or if there 
are more insertions than deletions) and are uniformly distributed (that is, the deletions are 
not bunched together to significantly reduce the total number of keys in the tree temporar- 
ily), the space penalty is tolerable in exchange for the advantage of ease of deletion. There 
is also a small time penalty in subsequent searches, since some keys will require more 
nodes to be examined than if the deleted key had never been inserted in the first piace. 
If we are unwilling to pay the space/search-time penalty of simplified deletion, 
there are more expensive deletion techniques that eliminate the penalty. In an unre- 
stricted multiway search tree, a technique similar to deletion from a binary search tree 
can be employed: 

1. If the-key to be deleted has an empty left or right subtree, simply delete the key 
and compact the node. If it was the only key in the node, free the node. 

2. If the key to be deleted has nonempty left and right subtrees, find its successor 
key (which must have an empty left subtree); let the successor key take its place 
and compact the node that had contained that successor. If the successor was the 
only key in the node, free the node. 


We leave the development of a detailed algorithm and program to the reader. 
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However, this procedure may result in a tree that does not satisfy the requirements 
for either a top-down tree or a B-tree, even if the initial tree did satisfy those require- 
ments. In a top-down tree, if the key being deleted is from a semileaf that is not a leaf 
and the key has empty right and left subtrees, the semiledf will be left with fewer than 
n — 1 keys, even though it is not a leaf. This violates the top-down requirement. ‘In 
that case it is necessary to choose a random nonempty subtree of the node and move 
the largest or smallest key from that subtree into the semileaf from which the key was 
deleted. This process must be repeated until the semileaf from which a key is taken is 
a leaf. This leaf can then be compacted or freed. In the worst case, this might require 
rewriting one node at each level of the tree. 

In a strict B-tree, we must preserve the requirement that each node contains at 
least (n — 1)/2 keys. As noted previously, if a key is being deleted from a nonleaf node, 
its successor (which must be in a leaf) is moved to the deleted position and the deletion 
proceeds as if the successor were deleted from the leaf node. When a key (either the key 
to be deleted or its successor) is removed from a leaf node and the number of keys in 
the node drops below (n.— 1)/2, remedial action must be taken. This situation is called 
an underflow. When an underflow occurs, the simplest solution is to examine the leaf’s 
younger or older brother. If the brother contains more than (1 — 1)/2 keys, the key ks in 
the father node that separates between the two brothers can be added to the underflow 
node and the last or first key of the brother (last if the brother is older; first if younger) 
added to the father in place of ks. Figure 7.3.9a illustrates this process on an order-5 
B-tree. (We should note that once a brother is being accessed, we could distribute the 
keys evenly between the two brothers rather than simply. shifting one key. For example, 
if the underflow node n1 contains 106 and 112, the separating key in the father fis 120, 
and the brother n2‘contains 123, 128, 134, 139, 142, and 146 in an order-7 B-tree, we 
can rearrange them so that n1 contains 106, 112, 120, and 123, 128 moves up to fas the 
separator, and 134, 139, 142, and 146 remain in 2.) 

If both brothers contain exactly (n — 1)/2 keys, no keys can be shifted. In that 
case the underflow node and one of its brothers are concatenated, or consolidated, into 
a single node that also contains the separator key from their father. This is illustrated in 
Figure 7.3.9b, where we combine the underflow node with its younger brother. 

Of course, it is possible that the father contains only (n — 1)/2 keys, so that it also 
has no extra key to spare. In that case it can borrow from its father and brother as in 
Figure 7.3.10a. In the worst case, when the father’s brothers also have no spare keys, 
the father and its brother may also be consolidated and a key taken from the grandfather. 
This is illustrated in Figure 7.3.10b. Potentially, if all nonroot ancestors of a node and 
their brothers contain exactly (n — 1)/2 keys, a key will be taken from the root, as 
consolidations take place at each level from the leaves to the level just below the root. 
If the root had more than‘one key, this ends the process, since the root of a B-tree need 
have only one key. If, however, the foot contained only a single key, that key is used in 
the consolidation of the two nodes below the root, the root is freed, the consolidated node 
becomes the new root of the tree, and the depth of the B-tree is reduced. We leave to the 
reader the development of an actual algorithm for B-tree deletion from this description. 

You should note, however, that it is foolish to form.a consolidated node with n — | 
keys if a Subsequent insertion will immediately split. the node in two. In a B-tree of large 
order, it may make sense to leave an underflow node with fewer than (n — 1)/2 keys 
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(even though this violates the formal B-tree requirements) so that future insertions can 
take place without splitting. Typically, a minimum number (min less than (n — 1)/2) of 


keys is defined so that consolidation takes place only if fewer than min keys remain in 
a leaf node. i 


Efficiericy of Multiway Search Trees 


The primary consi. “rations in evaluating the efficiency of multiway search trees, 
as for all data structures, are time and space. Time is measured by the number of nodes 
accessed or modified in an operati.:n, rather than by the number of key comparisons. The 
reason for this, as mentioned earlier, is that accessing a node usually involves reading 
from external storage and modifying a node involves writing to external storage. These 
operations are far more time-consuming than internal memory operations and therefore 
dominate the time required. 

Similarly, space is measured by the number of nodes in the tree and the size of 
the nodes rather than by the number of keys actually contained in the nodes, since 
the same space is allocated for a node regardless of the number of keys it actually 
contains. Of course, if the records themselves are stored outside the tree nodes, the 
Space requirement for the records is determined by how the record storage is organized 
rather than by hew the tree itself is organized. The storage requirements for the records 
generally overwhelm the requirements for the key tree, so that the actual tree space may 
not be significant. 

First, let us examine top-down multiway search trees. Assuming an order-m tree 
and n records, there are two extreme possibilities. In the worst case for search time, 
the tree is totally unbalanced. Every node except one is a semileaf with one son and 
contains m — 1 keys. The single leaf node contains ((n — 1) % (m — 1)) + 1 keys. The 
tree contains ((n — 1)/(m — 1)) + 1 nodes, one on each level. A search or an insertion 
accesses half that many nodes on the average and every node in the worst case. An 
insertion also requires writing one or two nodes (one if the key is inserted in the leaf, 
two if a néw leaf must be created). A deletion always accesses every node and can 
modify as few as one node, but can potentially modify every node (unless a key can be 
simply marked as deleted). ie 

In the best case for search time, the tree is almost balanced, each node except 
one contains m — 1 keys, and each nonleaf except one has m sons. There are stil] 
((n — 1)/(@m — 1)) + 1 nodes, but there are fewer than log» (n — 1) + 1 levels. Thus 
the number of nodes accessed in a search. insertion, or deletion is less than this num- 
ber. (In such a'tree, more than half the keys are in a semileaf or leaf, so that the average 
search time is not much better than the maximum.) Fortunately, as is the case for binary 
trees, fairly balanced trees occur far more frequently than unbalanced trees, so that the 
average search time using multiway search trees is O(log n). 

However, a general multiway tree and even a top-down multiway tree use an 
inordinate amount of storage. To see why this is so, see Figure 7.3.11. which illustrates 
a typical top-down multiway search tree of order |] with 100 keys. The tree is fairly 
balanced and average search cost is approximately 2.19 [10 keys at level 0 require 
accessing one node. 61 at level | require accessing two nodes. and 29 at level 2 re- 
quire accessing three nodes: (10* 1 +6] «2+ 29*3) 100 = 2.19], which is reasonable. 
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However, to accommodate the 100 keys, the tree uses 23 nodes or 4.35 keys per node, 
representing a space utilization of only 43.5 percent. The reason for this is that many 
leaves contain only one or two keys, and the vast majority of the nodes are leaves. As the 
order and the number of keys increase, the utilization becomes worse, so that an order- 
11 tree with thousands of keys can expect 27 percent utilization, and an order-21 tree 
can expect only 17 percent utilization. As the order grows even larger, the utilization 
drops toward 0. Since high orders are required to produce reasonable search costs for 
large numbers of keys, top-down multiway-trees are an unreasonable alternative for 
data storage, $ 

Every B-tree is balanced and each node contains at least (m — 1)/2 keys. Figure 
7.3.12 illustrates the minimum «3d maximum number of nodes and keys at levels 0, 
1, 2, and an arbitrary level i, as well as the minimum and. maximum number of total 
nodes and keys in a B-tree of order m and maximum level d. In that figure, q equals 
(m — 1)/2. Note that the maximum level is 1 less than the number of levels (since the 
root is at level 0), so that + 1 equals the maximum number of node accesses needed to 
find an element. From the minimum total number of keys in Figure 7.3.12, we can de- 
duce that the maximum number of node accesses for one of n keys in an order-m B-tree 
is 1 + logg+; (n/2). Thus, unlike top-down multiway trees, the maximum number of 
node accesses grows only logarithmically as the number of keys. Nevertheless, as we - 
pointed out earlier, average search time is competitive between top-down multiway 
trees and B-trees, since top-down trees are usually fairly balanced. ” 

Insertion into a B-tree requires reading one node per level and writing one node 
at minimum plus two nodes for every split that occurs. If s splits occur, 2s + 1 nodes 
are written (two halves of each split plus the father of the last node- split). Deletion 
requires reading one node per level to find a leaf, writing one node if the deleted key 
is in a leaf and the deletion does not cause an underflow, and writing two nodes if the 
deleted key is in a nonleaf and removing the replacement key from a leaf does not cause 
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that leaf to underflow. If an underflow does occur, one additional read (of the brother 
of each underflowed node) per underflow, one additional write for every consolidation 
except the Jast, and three additional writes for the final ynderflow if no consolidation is 
necessary (the underflow node, its brother, and its father) or two additional writes if a 
consolidation is necessary (the consolidated node and its father) are required. All these 
operations are O(log n). 

As is the case with a heap (Section 6.3) and a balanced binary tree (Section 7.2), 
insertion and deletion of the minimum or maximum element are both O(log n) in a 
B-trec; therefore the structure can be used to implement an (ascending or descending) 
priority queue efficiently. In fact, a 3-2 tree (a B-tree of order 3) is probably the most 
efficient practical method for implementing 1 priority queue in internal memory. 

Since each node in a B-tree (except the root) must be at least approximately half 
full, the worst case storage utilization approaches 50 percent. In practice, average stor- 
age utilization in a B-tree approaches 69 percent. For example, Figure 7.3.13 illustrates 
a B-tree of order 11 with the same 100 keys as the multiway tree of Figure 7.3.11. Av- 
erage search time is 2.88 (1 key requiring 1 node access, 10 keys requiring 2 accesses, 
and 89 keys requiring 2 accesses), which is greater than the corresponding multiway 
tree. Indeed, a fairly balanced top-down multiway tree will have lower search cost than 
a B-tree, since all its upper nodes are always completely full. However, the B-tree con- 
tains only 15 nodes, yielding a storage utilization of 66.7 percent, far higher than the 
43.5 percent utilization of the multiway tree. 


. Improving the B-Tree 


There are a number of ways of improving the storage utilization of a B-tree. One 
method is to delay splitting a node when it overflows. Instead, the keys in the node and 
one of its adjacent brothers, as well as the key in the father that separates between the 
two nodes, are redistributed evenly. This is illustrated in Figure 7.3.14 on a B-tree of 
order 7. When both a node and its. brother are full, the two nodes are split into 3. This 
guarantees a minimum storage utilization of almost 67 percent, and the storage utiliza- 
tion is higher in actual practice. Such a tree is called a B*-tree. Indeed, this technique 
can be extended even further by redistributing keys among all the brothers and the father 
of a full node. Unfortunately, this method exacts its own price, since it requires expen- 
sive additional accesses upon overflow insertions, while the marginal additional space 
utilization achieved by considering each extra brother becomes smaller and smaller. 

Another technique is to use a compact B-tree. Such a B-tree has maximum stor- 
age utilization for a given order and number of keys. It can be shown that this maximum 
storage utilization for a B-tree of a given order and a given number of keys is attained 
when nodes toward the bottom of the tree contain as many keys as possible. Figure 
7.3.15 illustrates a compact B-tree for the 100 keys of Figures 7.3.11 and 7.3.13. It 
can be shown that the average search-cost for a compact B-tree is never more than | 
more than the minimum average search cost among all B-trees with the given order 
and number of keys. Thus, although a compact B-tree achieves a maximum storage 
utilization, it also achieves reasonable search cost. For example, the search cost for the 
tree of Figure 7.3. 14 is only 1.91 (9 keys at level 0, requiring one access, and 91 keys 
at level 1, requiring two accesses: 9*1+91*2 = 191/100 = 1.91), which is very close 
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(b) After insertion of 35 and redistribution of brothers, 


10 15 20 30 35 40-~ 60 80 100 110 120 130 


(c) After insertion of 15 and 80. 


100 110 120 130 


(d) After insertion of 90 ahd redistribution of father and brouie=. 


Figure 7.3.14 B-tree of Order 7 with redistribution of multiple nodes. 
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to optimal. Yet the tree uses only 1] nodes, for a storage utilization of 90.9 percent. 
With more keys, Storage utilization in compact B-trees reaches 98 percent or even 99 
percent. cae 

Unfortunately, there is no known efficient algorithm to insert a key into a compact 
B-tree and maintain compactness. Instead, insertion proceeds as in an ordinary B-tree 
and compactness is not retained. Periodically (for example, at night when the file is 
not used), a compact B-tree can be constructed from the noncompact tree. However, 
a compact B-tree degrades so rapidly with insertions that, for high orders, storage uti- 
lization drops below that of a random B-tree after fewer than 2 percent additional keys 
have been inserted. Also, the number of splits required for an insertion is higher on the 
average than for a random B-tree. Thus. a compact B-tree should only be used when 
the set of keys is highly stable. : 


For example, if three keys are anchor, andrew, and antoin, andrew can be encoded 
as 2d, indicating that the first two characters are the same as its predecessor and that the 
next character, d, distinguishes it from its predecessor and successor. If the successors 
of andrew within the node were andule, antoin, append, and apples, andrew would be 
encoded as 2d, andule as 3u, antoin as simply 2, and append as Ippe. 

If rear compression is used, it is necessary to access the record itself to determine if 
a key is present in a file, since the entire key cannot be reconstructed from the encoding. 
Also, under both methods, the key code that is retained is of variable length, so that 
the maximum number of keys in a node is no longer fixed. Another disadvantage of 
variable-length key encoding is that binary search can no longer be used to locate a key 
in a node. In addition, the key code for some existing keys may have to be changed 
when a new key is inserted. The advantage of compression is that it enables more keys 


to be retained in a node, so that the depth of the tree and the number of nodes required 
can be reduced. 


B + -Trees 


One of the major drawbacks of the B-tree is the difficulty v; aversing the keys 
Sequentially. A variation of the basic B-tree Structure, the B*-tree, retains the rapid 
random access Property of the B-tree, whi'e also allowing rapid sequential access. In 
the B*-tree, all keys are maintained in leaves, and keys are replicated in nonleaf nodes 
to define paths for locating individual records. The leaves are linked together to provide 
a sequential path for traversing the keys in the tree. 
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Figure 7.3.16 illustrates a B*-tree. To locate the record associated with key 53 
(random access), the key is first compared with 98 (the first key in the root). Since it 
is less, proceed to node B. Fifty-three is then compared with 36 and then 53 in node B. 
Since it is less than or equal to 53, proceed to node E. 

Note that the search does not halt when the key is found as is the case in a B- 
tree. In a B-tree, a pointer to the record corresponding to a key is contained with each 
key in the tree, whether in a leaf or'a nonleaf node. Thus once the key is found, the 
record can be accessed. In a B*-tree, pointers to records are only associated with keys 
in leaf nodes; consequently the search is not complete until the key is located in a leaf. 
Therefore, when equality is obtained in a nonleaf, the search continues. In node E (a 
leaf), key 53 is located, and from it, the record associated with that key. If we now wish 
to traverse the keys in the tree sequentially beginning with key 53, we need only follow 
the pointers in the leaf nodes. 

The linked list of leaves is called a Sequence set. In actual implementations, the 
nodes of the sequence set frequently do not contain all the keys in the file. Rather, each 
Sequence set node serves as an index to a large data area where a large number of records 
are kept. A search involves traversing a path in the B* -tree, reading a block from the 
data area associated with the leaf node that is finally accessed, and then searching the’ 
block sequentially for the required record. 

The B* -tree may be considered to be a natural extension of the indexed sequential 
file of Section 7.1. Each level of the tree is an index to the succeeding level, and the 
lowest level, the sequence set, is an index to the file itself. 

Insertion into a B* -tree proceeds much as in a B-tree except that when a node is 
split, the middle key is retained in the left half-node as well as being promoted to the 
father. When a key is deleted from a leaf. it can be retained in the nonleaves, since it is 
still a valid separator between the keys in the nodes below. 

The B*-tree retains the search and insertion efficiencies of the B-tree but in- 
creases the efficiency of finding-the next record in the tree from O(log n) (in a B-tree, 
where finding the successor involves climbing up or down the tree) to O(1) (in a B+- 
tree, where it involves accessing one additional leaf at most). An additional advantage 
of the B*-tree is that no record pointers need be kept in the nonleaf nodes, which in- 
creases the potential order of the tree. 


Digital Search Trees 


Another method of using trees to expedite searching is to form a general tree 
based on the symbols of which the keys are composed. For example, if the keys are 
integers, each digit position determines one of ten possible sons of a given node. A forest 
representing one such set of keys is illustrated in Figure 7.3.17. If the keys consist of 
alphabetic characters, each letter of the alphabet determines a branch in the tree. Note 
that every leaf node contains the special symbol eok, which represents the end of a key. 
Such a leaf node must also contain a pointer to the record that is being stored. 

If a forest is represented by a binary tree, as in Section 5.5, each node of the binary 
tree contains three fields: symbol, which contains a symbol of the key; son, which is a 
pointer to the node’s oldest son in the original tree: aoa rig which is a pointer to 
the node's next younger brother in the original tree. The first tree in the forest is pointed 
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Figure 7.3.17 Forest representing a table of keys.” 


to by an external pointer tree, and the roots of the other trees in the forest are linked 
together in a linear list by the brother field. The son field of a leaf in the original forest 
points to a record; the concatenation of all the symbols in the original forest in the path 
of nodes from root to the leaf is the key of the record. We make two further stipulations 
that speed up the search and insertion process for such a tree: each list of brothers is 
arranged in the binary tree in ascending order of the symbol field, and the symbol eok 
is considered to be larger than any other. 

Using this binary tree representation, we may present an algorithm to search and 
insert into such a nonempty digital tree. As usual, key is the key for which we are 
searching, and rec is the record that we wish to insert if key is not found. We also let 
key(i) be the ith symbol of the key. If the key has n symbols, we assume that key(n) 
equals eok. The algorithm uses the getnode operation to allocate a new tree node when 
necéssary. We assume that recptr is a pointer to the record rec to be inserted. The algo- 
rithm returns a pointer to the record that is being sought and uses an auxiliary function 
insert, whose algorithm is also given. 


p*= tree; 
father = null; /* father is the father of p */ 
for (i = 0;; i++) { 
q= null; /* q points to the other brother of p */ 
while (p != null && symbol(p) < key(i)) { 
q= p; 
p = brother(p); 
} /* end while */ 
if (p == null || symbol(p) > key(i)) { 
insval = insert(i, p); 
return(insval); 
} /* end if */ 
if (key(i) == eok) 
return(son(p)); 
else { 
father = p; 
p = son(p); 
. | /* end else */ 
} /* end for */ 


The algorithm for insert is as follows: 


/* insert the ith symbol of the key */ 
s = getnode(); 
symbol(s) = key(i); 
brother(s) = p; 
if (tree == null) 
tree = s; 
else 
if (q != null) 
brother(q) = s; 
else 
(father == null) ? tree = s : son(father) = s$; 
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/* insert the remaining symbols of the key */ 
for (j = i; key(j) != eok; j++) { 
father = s; 
S = getnode(); 
symbol(s) = key(j + 1); 
son(father) = s: 
brotner(s) = nul); 
} /* end for */ 
son(s) = addr(rec); 
return(son(s)); 


Note that by keeping the table of keys as a general tree, we need search only a 
small list of sons to find whether a given symbol appears at a given position within the 
keys of the table. However, it is possible to make the tree even smalier by eliminating 
those nodes from which only a single leaf can t reached, For example, in the keys of 
Figure 7.3.17, once the symbol ‘7° is recognized. the only key that can possibly match 
is 768. Similarly, upon recognizing the two symbols `i and *9°. the only matching key 
is 195. Thus the forest of Figure 7.3.17 can be abbreviated to ihe one of Figure 7.3.18. 
In that figure a box indicates a key and a circle indicates a tree node. A dashed line is 
used to indicate a pointer from a tree node to a key. 

There are some significant differences between the trees of Figures 7.3.17 and 
7.3.18. In Figure 7.3.17. a path from a root to a leaf represents an entire key; thus there 
is no need to repeat the key itself. In Figure 7.3.18. however, a key may be recognized 
only by its first few symbols. In those cases in which the search is made for a key 
that is known to be in the table. upon finding a leaf the record corresponding to that 
key car be accessed. If, however. as is more likely, it is not known whether the key 
is present in the table. it must be confirmed that the key is indeed correct. Thus the 
entire key must be kept.in the record as well. Furthermore. a leaf node in the tree of 
Figure 7.3.17 can be recognized, because its contents are eok. Thus its son pointer can 
be used to point to the record that that leaf represents. However. a leaf node of Figure 
7.3.18 may contain any symbol. Thus. to use the son pointer of u leaf to point to the 
record, an extra field is required in each node to indicate whether or not the node is a 
leaf. We leave the representation of the forest of Figure 7.3.18 anu Jie implementation 
of a search-and-insert algorithm for it as an exercise for the reader. 

The binary tree representation of a digital search tree is efficient when each node 
has relatively few sons. For example. in Figure 7.3.18 only one node has as many as six 
(out of a possible ten) sons. whereas most nodes have only one. two. or three sons. Thus. 
the process of searching through the list of sons to match the next symbol in the key 
is relatively efticient. However. if the set of keys is dense within the set of all possible 
keys (that is, if almost any possible combination of symbols actually appears as a key), 
most nodes will have a large number of sons, and the cost of the search process becomes 
prohibitive. 


Tries 


A digital search tree need not be implemented as a binary tree. Instead. each 
node in the tree can contain m pointers, cos pending to the m possible syinbols in 
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Figure 7.3.18 Condensed forest represenung a table of keys. 


each position of the key. Thus, if the keys were numeric, there would be 10 pointers 
in a node, and if strictly alphabetic, there would be 26. (There might also be an extra 
pointer corresponding to eok, or a flag with each pointer indicating that it pointed to a 
record rather than to a tree node.) A pointer in a node is associated with a particular 
symbol value based on its position in the node; that is, the first pointer corresponds to 
the lowest symbol value, the second pointer to the second Jowest, and so forth. It is 
therefore unnecessary to keep the symbol values themselves in the tree. The number 
of nodes that must be accessed to find a particular key is log mn. A digital search tree 
implemented in this way is called a trie (from the word retrieval). 

A trie is useful when the set of keys is dense, so that most of the pointers in each 
node are used, When the key set is Sparse, a trie wastes a large amount of space with 
large nodes that are mostly empty. If the set of keys in a trie is known in advance and 
does not change, there are a number of techniques for minimizing the space require- 
ments. One technique is to establish’a different order in which the symbols of a key are 
used for searching (so that, for example, the third symbol of the argument key might be 
used to access the appropriate pointer in the trie 5 ot, the first symbol in Jevel 1 nodes, 
and so forth). Another technique is to allow trie nudes to overlap each other, so that 
occupied pointers of one node overlay empty pointers of another, 


EXERCISES 


7.3.1. Show how a B-tree and a B*-tree can be used to implement a priority queue (see 
Sections 4.1 and 6.3). Show that any sequence of n insertion and minimum-deletion 
operations can be performed in O(n log n) steps. Write C routines to insert and delete 
from a priority queue implemented by a 2-3 tree. 

7.3.2. Choose any large paragraph from a book. Insert each word of the paragraph, in order, 
into an initially empty top-down multiway search tree of order 5. omitting any dupli- 
cates. Do the same for a B-tree of order 5, a B~-tree of order 5, and a digital search 
tree, 

7.3.3. Write C routines to implement the B-tree successor insertion operations if the B-tree 
is maintained 
(a) In internal memory 
(b) In external direct access storage 


7.3.4. Write an algorithm and a C routine to delete a record from a top-down multiway search 
tree of order n. 


7.3.5. Write an algorithm and a C routine to delete a record from a B-tree of order n. 
7.3.6. Write an algorithm to create a compact B-tree from input in sorted order. Use the 

algorithm to write a C routine to produce a compact B-tree from an ordinary B-tree. 
7.3.7. Write an algorithm and a C routine to search in a B-tree. 
7.3.8. Write an algorithm and a C routine to 

(a) Insert into a B*-tree 

(b) Insert into a B’-tree 

({c) Delete from a B*-tree 

(d) Delete from a B’-tree 
7.3.9. How many different 2-3 trees containing the integers | through 10 can you construct? 


How many permutations of these integers result in each tree if they are inserted into 
an initially empty tree in permutation order? 
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7.3.10. Develop algorithms to search and insert into a B-tree that uses front and rear compres- 
sion. 


7.3.11. Write a search-and-insert algorithm and C routine for the digital search forest of 
Figure 7.3.18. 


7.3.12. Show how to implement a trie in external storage. Write a C search-and-insert routine 
z for a trie. 


7.4 HASRING 


in the preceding two sections we assumed that the record being sought is stored in a 
table and that it is necessary to pass through some number of keys before finding the 
desired one. The organization of the filè (sequential, indexed sequential, binary tree, 
and so forth) and the order in which the keys are inserted affect the number of keys 
that must be inspected before obtaining the desired one. Obviously. the efficient search 
techniques are those that minimize the number of these compariscns. Optimally, we 
would like to have a table organization and search technique in which there are no 
unnecessary comparisons. Let us see if this is feasible. 

If each key is to be retrieved in a single access, the location of the record within 
the table can depend only on the key: it may not depend on the locations of other keys, 
as in a tree. The most efficient way to organize such a table is as an array (that is, each 
record is stored at a specific offset from the base address of the table). If the record keys 
are integers, the keys themselves can serve as indices to the array. 

Let us consider an example of such a system. Suppose that a manufacturing com- 
pany has an.inventory file consisting of 100 parts, each part having a unique two-digit 
part number. Then the obvious way to store this file is to declare an array 


parttype part[100j; 


where part{i] represents the record whose part number is i. In this situation, the part 
numbers are keys that are used as indices to the array. Even if the company stocks tewer 
than 100 parts, the same structure can be used to maintain the inventory file. Although 
many locations in part may correspond to nonexistent keys, this waste is offset by the 
advantage of direct access to each of the existent parts. 

Univrtunately, however. such a system is not always practical. For example, sup- 
pose that the company has an inventory file of more than 100 items and the key to each 
record is a seven-digit part number. To use direct indexing using the entire seven-digit 
key. an array of 10 million elements would be required. This clearly wastes an unac- 
ceptably large amount of space because it is extremely uulikely that a company stocks 
more than a few thousand parts. 

What is necessary is some method of converting a key 7> 7n integer within a 
limited range. Ideally. no two keys should be converted into the same integer. Unfortu- 
nately, such an ideal method usually does not exist. Let us attempt to develop methods 


that come close to the ideal, and determine what action to take when the ideal is not 
achieved. 
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Let us reconsider the example of a company with an inventory file in which each 
record is keyed by a seven-digit part number. Suppose that the company has fewer 
than 1000 parts and that there is only a single record for each part. Then an array of 
1000 elements is sufficient to contain the entire file. The array is indexed by an integer 
between 0 and 999 inclusive. The last three digits of the part number are used as the 
index for the part’s record in the array. This is illustrated in Figure 7.4.1. Note that two 
keys that are relativeiy close to each other numericaliy, such as 4618396 and 4618996, 
may be farther from each other in the table than two keys that are widely separated 
numerically, such as 0000991 and 9846995, Only the last three digits of the key ure 
used in determining the position of a record. 

A function that transforms a key into a table index is called a hash function. If 
his a hash function and key is a key, A(key) is called the hash of key and is the index 
at which a record with the key key should be placed. If ris u record whose key hashes 
into Ar, hr is called the hash key of r. The hash function in the preceding example is 
h(k) = kev % 1000. The values that h produces should cover the entire set of indices in 
the table. For example. the function x % 1000 can produce any integer between ( and 
999, depending on the value of x. As we shall see shortly, it is a good idea for the tuble 
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size to be somewhat larger than the number of records that are to be inserted. This is 
illustrated in Figure 7.4.1, where several positions in the table are unused. 

The foregoing method has one flaw. Suppose that two keys k] and k2 are such 
that A(k1) equals h(k2). Then when a record with key k1 is entered into the table, it is 
inserted at position h(k1). But when k2 is hashed, because its hash key is the same as 
that of k2, an attempt may be made to insert the record into the same position where 
the record with key k1 is stored. Clearly, two records cannot occupy the same position. 
Such a situation is called a hash collision or a hash clash. A hash clash occurs in the 
inventory exampie of Figure 7 4.1 if a record with key 0596397 is added to the table. 

There are two basic metnods of dealing with a hash clash. We explore both in 
detail in the remainder of this section. Briefly, the first technique, called rehashing, in- 
volves using a secondary hash function on the hash key of the item. The rehash function 
is applied successively until an empty position is found where the item can be inserted. 
If the hash position of the item is found to be occupied during a search, the rehash func- 
tion is again used to locate the item. The second technique, called chaining, builds a 
linked list of all items whose keys hash to the same values. During search, this short 
linked, list is traversed sequentially for the desired key. This technique involves adding 
an extra link field to each t? : position. 

However, it should ~ noted that a good hash function is one that minimizes colli- 
sions and spreads the records uniformly throughout the table. That is why it is desirable 
to have the array size larger than the numbe: of actual records. The larger the range of 
the hash function, the less likely it is that two keys yield the same hash value. Of course, 
this involves a space/time trade-off. Leaving empty spaces in thi + irray is inefficient in 
terms of space; but it reduces the necessity of resolving hash clashes and is therefore 
more efficient in terms of time. 

Although hashing allows direct access to a table and is therefore peeferalile't to 
other search techniques, the method has one serious flaw. Items in a hash table are not 
stored sequenually by key, nor is there any generally practical method for traversing the 
items in key sequence. Order-preserving hash functions, in which h(key1) > h(key2) 
whenever keyl > key2, are usually nonuniform; that is, they do not minimize hash 


collisions and so do not serve the basic purpose of hashing: rapid access to any record 
directly from its key. 


Resolving Hash Clashes by Open Addressing 


Let us consider what would happen if it were desired to enter a new part num- 
ber 0596397 into the table of Figure 7.4.1. Using the hash function key % 1000, 
#(0596397) = 397; therefore the record for that part belongs in position 397 of the 
array. However. position 397 is already occupied by the recotd with key 4957397. 
Therefore the record with key 0596397 must be inserted els ‘where in the table. 

The simplest method of resolving hash clashes is to place ‘>e record in the next 
available position in the array. In Figure 7.4.1, for example, since position 397 is already 
occupied. the record with key 0596397 is placed in location 398, which is still open. 
Once that record has been inserted, another record that hashes to either 397 (such as 


8764397) or 398 (such as 2194398) is inserted at the next available position, which 
is 400. 
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This technique is called linear probing and is an example of a general method 
for resolving hash clashes called rehashing or open addressing. In general, a rehash 
function, rh, accepts one array index and Produces another. If array location h(key) is 
already occupied by a record with a different key, rh is applied to the value of h(key) 
to find another location where the record may be placed. If position rh(h(key)) is also 
occupied, it too is rehashed to see if rh(rh(h(key))) is available. This process continues 
until an empty location is found. Thus we may write a search and insertion function 
using hashing as follows. We assume the following declarations: 


#define TABLESIZE... 

typedef KEYTYPE ... 

typedef RECTYPE ... 

struct record { 
KEYTYPE k; 
RECTYPE r; 

} table[TABLESIZE]; 


We also assume a hash function A(key) and a rehash function rh(i). The special value 
nullkey is used to indicate an empty record. 


int search(KEYTYPE key, RECTYPE rec) 
{ 


‘int i; 

i = h(key); /* hash the key */ 

while (table[i].k != key && table[i].k != nul lkey) 
i= rh(i); /* rehash */ 

if (table[i].k == nullkey) { 
/* insert the record into the empty position */ 
table[i].k = key; 
table[i].r = rec; 

} /* end if */ 

return(i); 

} /* end search */ 


In the example of Figure 7.4.1, h(key) is the function key % 1000, and rh(i) is the 
function (i + 1) % 1000 (that is, the rehash of any index is the next sequential position 
in the array, except that the rehash of 999 is 0). 

Let us examine the algorithm more closely to see if we can determine the prop- 
erties of a “good” rehash function. In particular, we focus our attention on the loop. 
because the number of iterations determines the efficiency of the search. The loop can 
be exited in one of two ways: either j is set to a value such that table[i].k equals Key (in 
which case the record is found). or i is set to a value such that table[i}.k equals nullkey 
(in which case an empty position is found and the record may be inserted). 

It may happen, however, that-the loop executes forever. There are two possible 
reasons for this. First, the table may be full, so that it is impossible to insert any new 
records. This situation can be detected by keeping a count of the number of records in 
the table. When the count equals the table size, no additional insertions are attempted. 


A 
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However, it is possible for the algorithm to loop indefinitely even if there 
are some (or even many) empty positions. Suppose, for example, that the function 
rh(i) = (i'+ 2) % 1000 is used as a rehash function. Then any key that hashes into an 
odd integer rchashes into successive odd integers. and any key that hashes into an even 
integer rehashes into successive even integers. Consider the situation in which all the 
odd positions of the table are occupied and all the even ones are empty. Despite the 
fact that half the positions of the array are empty, it is impossible to insert a new record 
whose key hashes into an odd integer. Of course, it is unlikely that all the odd positions 
are occupied, while none of the even positions are. However, if the rehash function 
rh(i) = (i + 200) % 1000 is used. each key can be placed in only one of five positions 
[since x % 1000 = (x + 1000) % 1000). and it is quite possible for these five _ to 
be full while much of the table is empty. 

One property of a good rehash function is that for any index i, the successive 
rchashes rh(i), rh(rh(i)), ... cover as many of the integers between 0 and tablesize — | 
as possible (ideally. all of them). The rehash function rath = (i + 1) % 1000 has this 
property. In fact, any function rh(i) = (i + c) % tabiesize. where c is a constant value 
such that c and tablesize are relatively prime (that is. they cannot both be divided evenly 
by a sifigle integer other than !), produces successive values that cover the entire table. 
You are invited to confirm this fact by choosing some examples; the proof is left as an 
exercise. In general, however, there is ro reason to choose a value of C other than 1. If 
the hash table is stored in external storage, it is desirable to have successive references 
as Close to each other as possible (this minimizes seek delay on disks and may eliminate 
an //O if the two references are on the same page). 

There is another measure of the suitability of a rehash function. Consider 
the case of the linear rehash. Assuming that the hash function produces indices 
that are uniformly distributed over the interval O through tublesize — 1 [that is, 
it is equally likely that (key) is any particular integer in that range}. then initially, 
when the entire array is empty, it is equally likely that a random record will be 
placed at any given (empty) position within the array. However, once entries have 
been inserted and several hash clashes have been resolved. this is no longer true. For 
example. in Figure 7.4.1 it is five times as likely for a record to be inserted at position 
994 than at position 401. This is because any record whose key hashes into 990, 
991, 992, 993, or 994 will be placed in 994, whereas only a record whose key hashes 
into 401 will be placed in that location. This phenomenon, where two keys that hash 
into different values compete with each other in successive rehashes, is called primary 
clustering. 

The same phenomenon occurs in the case of the- van function riti) = (i + c) 
% tablesize. For example, if tablesize = 1000. ¢ = 21, and positions 10, 31, 52, 73, 
and 94 are all occupied, any record whose key is any one of these five integers will be 
placed at location 115. {n fact, any rehash function that depends solely on the index to 
be rehashed causes primary clustering. 

One way of eliminating primary clustering is to allow i. hash function te 
depend on the number of times that the function is applied to a particular hash value. In 
this approach the function rh is a function of two arguments. rh(i,j) yieids the rehash 
of the integer i if the key is being rehashed for the jth time. One example is shti. j) = 
(i + j) % tablesize. The first rehash yields rhl = rhihikex), 1) = (hikey) +1) % table- 


Searching Chap. 7 
472 


size, the second yields rh2 = (rh) + 2) % tablesize, the third yields rh3 = (rh2 + 3) 
% tablesize, and so on, 

Another approach is to use a random permutation of the numbers between | 
and !, (where f equals tablésize — 1, the largest index of the table), pl, p2, ..., pt, 
and to let the jth rehash of A(key) be (h(key) + pj) % tablesize. This has the advan- 
tage of ensuring that no two rehashes of the same key. conflict. Still a third approach 
is to let the jth rehash of h(key) be (A(key) + sqr(j)) % tablesize. This is called the 
quadratic rehash. ,.' another method of eliminating primary clustering is to allow 
the rehash to depend on the hash value. as in rh(i.kev) = (i + hkey) % tablesize, 
where hkey = 1 + h(key) © : (We cannot use hkey equal to h(key}, which might be. 
0, or to A(key) + 1, which migni be tablesize. Either of these cases would result in 
rh(i, key) equaling i, which is unacceptable). All these methods allow keys that hash 
into different locations to follow separate rehash paths. 

However, although these methods eliminate primary clustering, they do noi elimi- 
nate another phenomenon, known as secondary clustering, in which different keys that 
hash to the same value follow the same rehash path. One way to eliminate all clustering 
is double hashing, which involves the use of two hash functions, Al (key) and h2(key). 
h1, which is known as the primary hash function, is first used to determine the position 
at which the record should be placed. If that position is occupied, the rehash function 
rh(i, key) = (i + h2(key)) % tablesize is used successively until an empty location is 
found. As long as h2(kev1) does not.equal h2(key2). records with keys keyl and key2 
do not compete for the same set of locations. This is true despite the possibility that 
hl(key1) may indeed equal h1(key2). The rehash function depends not only on the in- 
dex to be rehashed but also on the original key. Note that the value h2(kev) does not 
have to be recomputed for each rehash: it need be coniputed only once for each key that 
must be rehashed. Optimally, one should choose functions A1 and 2 that distribute the 
hashes and rehashes uniformly over the interval 0 to rablesize — | and also minimize 
clustering. Such functions are not always easy to find. 

An example of double hashing functions is hl(key) = key % tablesize and 
h2(key) = 1 + key % t, where tablesize is a prime number and t equals tablesize —.1. 
Another example is hl(key) as defined above and h2(key) = 1 + (key/tablesize) % t. 


Deleting Items from a Hash Table 


Unfortunately, it is difficult to delete items from a hash table that uses rehashing 
for search and insertion. For example, suppose that record r1 is at position p. To add a 
record r2 whose key k2 hashes into p, it must be inserted into the first free position from 
among rh(p), rh(rh(y)), .... Suppose that r1 is then deleted, so that position p becomes 
empty. A subsequent search for record r2 begins at position /(k2), which is p. But since 
that position is now empty, the’search process may erroneously conclude that record r2 
is absent from the table. 

One possible solution to this problem is to mark a deleted record as “deleted” 
rather than “empty” and to continue searching whenever a “deleted” position is encoun- 


tered in the course of a search. But this is possible only if there are a small number of 


deletions; otherwise, an unsuccessful search would require « search through the entire 
table because most positions will be marked “deleted” rather than “empty.” Ideally, 
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we would prefer a deletion mechanism in which retrieval time is the same whenever 
n records are in the table, regardless of whether the n records are a result of n inser- 
tions or w insertions and w — n subsequent deletions. Later in this section we examine 


alternatives to rehashing that allow us to accomplish this. 


Efficiency of Rehashing Methods 


The efficiency of a hashing method is usually measured by the average number 
of table positions that must be examined in searching for a particular item. This is 
called the number of probes required by the méthod. Note that in the algorithms we 
have presented, the number of key comparisons equals twice the number of probes, 
since the key at each probe position is compared with both the search argument and 
nullkey. However, the comparison with nullkey may be less expensive than the general 
key comparison. An additional field can also be used in each table position to indicate 
whether it is empty to avoid an extra key comparison. 

Under rehashing, the average number of probes depends on both the hash func- 
tion and the rehash method. The hash function is assumed to be uniform. That is, it is 
assuméd that an arbitrary key is equally likely to hash into any table index as any other. 
Mathematical analysis of the average number of probes required to find an element in a 
hash table if the table had been constructed using a particular hash and rehash method 
can be quite involved. Let n be the number of items currently in the hash table, and let 
tablesize be the number of positions in the table. Then for large tablesize, it has been 
proved that the average number of probes required for a successful retrieval in a table 
organized using linear rehashing is approximately 


2 * tablesize — n + 1 
2 * tablesize —2*n +2 


If we set x = (n — I )/tablesize, this equals (2 — x)/(2 — 2x). Define the load fac- 
tor of a-hash table, /f, as n/tablesize, the fraction of the table that is occupied. Since 
If is approximately equal to x for large tablesize, we may approximate the number 
of probes for a successful search under linear rehashing by (2 — If/(2 — 2 * If) or 
0.5/1 — If) + 0.5. When Uf approximates 1 (that is, when the table is almost full), 
this formula is not useful. Instead, it can be shown that the average number of key 
comparisons for a successful search in an almost full table may be approximated by 
sqrt(m * tablesize/8) + 0.33. ` 

For an unsuccessful search, the average number of probes in a table organized 
using linear rehashing is approximately equal to 0.5/(1 — If)? + 0.5 for large tablesize. 
When the table is full (that is, when n = tablesize — 1, since one position must be 
left open to detect that the key is not present), the average number of probes for an un- 
successful comparison under linear rehashing is (tablesize + 1)/2, which is the same as 
the average number of comparisons required to find a single empty slot among tablesize 
slots by sequential search. 

For tables with low load factors, this performance is not unreasonable, but. for 
high load factors it can be improved considerably. Eliminating primary clustering by 
setting rh(i, key) to (i + hkey) % tablesize as defined previously or by using quadratic 
rehashing sets the average number of probes to approximately 1 — log (1 — /f) — Uff2 for 
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successful retrievals and 1/(1 — If) — If — log (1 — /f) for unsuccessful searches. (Here 
log is the natural logarithm as defined in the standard library math.h.) For full tables, 
successful search time approximates log (tablesize + 1), and unsuccessful search time 
remains at (tablesize + 1)/2. 

Double hashing improves efficiency even further by eliminating both primary and 
secondary clustering. Uniform hashing is defined as any hashing scheme in which any 
newly inserted element is equally likely to be placed at any of the empty positions of the 
hash table. For such a theoretical scheme, it can be proved that successful search time 
is approximately —log (1 — /A/If for large tablesize, and that unsuccessful searching 
requires (tablesize + 1)/(tablesize + 1 — n), or approximately 1/(1 — Jf), probes for 
large tablesize. Experience with good double hashing functions shows that the average + 
number of cpmparisons equals these theoretical values. For full tables, successful search 
time is approximately log (tablesize + 1) — 0.5, and unsuccessful search time is again 
(tablesize + 1)/2. 

The following table lists approximate number of probes for each of the three meth- 
ods for various load factors. Recall that these approximations are generally only valid 
for large table sizes. 


Successful Unsuccessful 
[Ee ae aan cea ee EE fhe ene 6 Nias. Sie = ae 
% Linear i+hkey Double Linear i+ hkey Double 
25 1.17 1.16 1.15 1.39 1.37 1.33 
50 1.50 1.44 1.39 2.50 2.19 2.00 
75 2.50 2.01 1.85 7.50 4.64 4.00 
90 ‘5.50 2.85 2.56 50.50 11.40 10.00 
95 10.50 3.32 3.15 200.50 22.04 20.00 


For full tables (in the successful case, where n equals tablesize; in the unsuc- 
cessful case, where n equals tablesize — 1), the following are some approximations of 
the average number of probes. We have also included the value of log (tablesize) for 
comparison with binary search and tree searching. 


Successful 


Tablesize Linear i+ hkey Double Unsuccessful Log;(tablesize) 


100 6.60 4.62 4.12 50.50 6.64 
500 14.35 6.22 5.72 250.50 7.97 
1,000 20.15 6.91 6.41 500.50 7.97 
5.000 ` 44.64 7.52 7.02 2,500.50 12.29 
10,000 63.00 7.21 7.71 5,000.50 13.29 


These data indicate that linear hashing should be strongly avoided for tables that 
become more than 75 percent full, especially if unsuccessful searches are common, 
since primary clustering does have a significant effect on search time for large load 
factors. The effects of secondary clustering, however, never add more than 0.5 probe to 
the average number required. Given the fact that double hashing requires an expensive 
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additional computation to determine h2(key), it may be preferable to accept the extra 
half probe and use rh(i, key) = (i + hkey) % tablesize. 

One technique that can be used to improve the performance Of linear rehashing 
is split sequence linear rehashing. Under this technique, when h(key) is found to be 
occupied, we compare key with the key kh located in position h(key). If kh < h(key), 
the rehash function į + cl is used: if kh > A(key), another rehash function, i + c2, is 
used. This splits the rehashes from a particular slot into two separate sequences and 
reduces clustering without requiring additional space or reordering the hash table. For 
tables with a load factor of 95 percent, the split sequence technique reduces the number 
of probes in a successful search by more than 50 percent and the number of probes in 
an unsuccessful search by more than 80 percent. However, nonlinear rehash methods 
are still better. A similar technique yields some, but not significant, improvement for 
the nonlinear rehash methods. 

Another point to note regarding efficiency is that, in the context of rehash- 
ing. the modulus operation should not be obtained by using the system % operator, 
which involves division, but rather by a comparison and possibly a subtraction. Thus 
rh(i, kev) = (i + hkev) % tablesize should be computed as follows: 


x= 7 + hkey; 
rh = x < tablesize ? x: x - tablesize; 


The foregoing tables also indicate the great expense of an unsuccessful search in a 
nearly full table. Insertion also requires the same number of comparisons as unsuccess- 
ful search. When the table is nearly full, the insertion efficiency of hashing approaches - 
that of sequential search and is far worse than tree insertion. 


Hash Table Reordering 


When a hash table is nearly full. many items in the table are not at the iocations 
given by their hash keys. Many key comparisons must be made before finding some 
items. If an item is not in the table, an entire list of rehash positions must be examined 
before that fact is determined. There are several techniques for remedying this situation. 

In the first technique, discovered by Amble and Knuth, the set of items that hash 
into the same location are maintained in descending order of the key. (We assume that 
NULLKEY is less than any key possibly occupying the table.) When searching for an 
item, it is not necessary to rehash repeatedly until an empty slot is found; as soon as an 
item in the table whose key is less than the search key is found, we know that the search 
key is not in the table. When inserting a key key, if a rehash accesses a key smaller than 
key. key and its associated record replace the smaller key in the table and the insertion 
process continues with the displaced key. A hash table organized in this way is called 
an ordered hash table. The following is a search and insertion function for an ordered 
hash table. (Recall that NULLKEY is less than any other key.) 


int search(KEYTYPE key, RECTYPE rec) 
{ 
int first, i, j; 


RECTYPE newentry, tempentry; 
KEYTYPE tk; 
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i = h(key); 
newentry.k = key; 
newentry.r = rec; 
first = TRUE; 
while (table[ij.k > newentry.k) 
i = rh(i); 
tk = table{i].k; 
while (tk != NULLKEY & tk != newentry.k) { 
/* inser. .*e new entry and displace */ 
ye the :¿ try at position i */ 
temoentry = table’**: 
table[i] = newentry; 
newentry = tempentry; 
if (first == TRUE) { 
j= i; /* j is the position in which */ 
/* the new record is inserted */ 
first = FALSE; 
} /* end if */ ý : 
j= rh(i); 
tk = table{i}.k; 
} /* end while */ 
if (tk == NULLKEY) { 
table(i] = newentry; 
if (first == TRUE) 
return(i); 
else 
return(j); 
} /* end search */ 


The ordered hash table method can be used with any rehashing technique in which 
a rehash depends only on the index and the key; it cannot be used with a rehash function 
that depends on the number of times the item is rehashed (unless that number is kept 
in the table). 

Using an ordered hash table does not change the average number of key compar- 
isons required to find a key that is in the table, but it reduces «` *nificantly the number 
of key comparisons necessary to determine that a key does not exist in the table. It can 
be shown that unsuccessful search in an ordered hash table requires the same aver- 
age number of probes as successful search (in an ordered or unordered table). This is 
a significant improvement. Unfortunately, however. the average number of probes for 
insertion is not reduced in an ordered hash table and equals the number required for an 
unsuccessful search in an unordered table. Ordered hash table insertions also require a 
significant number of hash table modifications. 


Brent's Method 


A different reordering scheme. atiributable to Brent, can be used to improve 
the average search time for successful retrievals when double hashing is used. The 
technique involves rehashing the search argument until an empty slot is found. Then 
each of the keys in the rehash path is itself rehashed to determine if placing one of those 
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keys in an empty slot would require fewer rehashes. (Recail that, under double hashing, 
the rehash paths for two keys that rehash to the same slot will diverge.) If this is the 
case, the search argument replaces the existing key in the table and the existing key is 
inserted in its empty rehash slot. 

The following is a routine to implement Brent's search and insertion algorithm. 
It uses auxiliary routines setempty, which initializes a queue of table indexes to empty: 
insert, which inserts an index onto a queue: remove, which returns an index removed 
from a queue: and freequeue, which frees all nodes of a queue. 


int search(KEYTYPE key, RECTYPE rec) 
{ 
struct queue qq; /* of table indexes */ 
int i, j, jj, minolcpos, minnewpos; 
int count, mincount, rehashcount, displacecount; 
KEYTYPE displacekey; 


setempty(qq): 
/* rehash repeatedly, placing each successive index */ 
/* in the queue and keeping a count of the number */ 


/* of rehashes required */ 

i = hl(key); 

for (count = 0; table[i].k != key & tableļi].k != nullkey; 
count++) { 


insert(qq, i); 

i =rh(i, key); 
} /* end for */ 
/* minoldpos and minnewpos hold the initial and final */ 
/* indexes of the key on the rehash path of key that */ 


/* can be displaced with a minimum of rehashing. */ 
J" Initially, assume no displacement and set them *7 
J" both to i, the first empty index for key */ 
minoldpos = i; 


minnewpos = 7; 
/* mincount is the minimum number of rehashes of key */ 
/* plus rehashes of the displaced key, displacekey. */ 
/* rehashcount is the number of rehashes of key */ 
7s needed to reach the index of the key being s 
/* displaced. Initially, assume no displacement. */ 
mincount = count; 
rehashcount = 0; 
/* The following loop determines if displacement of */ 
f= the key at the next rehash of key will yield a */ 
/* > lower total number of rehashes. If key was found */ 
/* in the table, then skip the loop */ 
if (table[i].k == nullkey) 
while (!empty(qq) && rehashcount+1 < mincount) { 
j = remove(qq); 
/* the candidate key for displacement */ 
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displacekey = table[j].k; 
jj = rh(j, displacekey); 
/* displacecount is the number of rehashes */ 


Je required to displace displacekey. */ 
for (displacecount = 1; table[jj).k != nullkey; 
displacecount++) 


jj = rh(jj, displacekey); 
if (rehashcount+displacecount <.mincount) { 
mincount = rehashcount + displacecount; 
minoldpos = j; 
minnewpos = jj; 
} /* end if */ 
rehashcount++; 
} /* end while */ 
/* free any extra items on the queue */ 
freequeue(qq); 
/* At this point, if no displacement is necessary ‘*/ 
/* — minoldpos equals minnewpos. minoldpos is the sf 
/* position where key was found or should be inserted. */ 
/* — minnewpos (if not equal to minoldpos) is the */ 
/* position where the key displaced from minoldpos */ 
+ should be placed, */ 
if (minoldpos != minnewpos) 
table[minnewpos] = table[minoldpos]; 
if (minoldpos != minnewpos || table[minoldpos].k == nullkey) { 
table[{minoldpos].k = key; 
table[minoldpos].r = rec; 
} /* end if */ 
return(minoldpos); 
} /* end search */ 


Brent’s method reduces the average number of comparisons for successful re- 
trievals but has no effect on the number of comparisons for unsuccessful searches. Also, 
the effort required to insert a new item is increased substantially. 

An extension of Brent’s method that yields even greater improvements in retrieval 
times at the expense of correspondingly greater insertion time involves recursive inser- 
tion of items displaced inp the table. That is, in determining the minimum number of 
rehashes required to displace an item on a rehash path, all the items on that item’s 
subsequent rehash path are considered for displacement as well, and so on. However, 
the recursion cannot be allowed to proceed to its natural conclusion, since insertion 
time would then become so large as to become impractical, even though insertion is 
infrequent. A maximum recursion depth of 4, plusan ‘litional modification by which 
tentative rehash paths longer than 5 are penalized excessively, has been found to yield 
average retrievals very close to optimal with reasonable efficiency. 

The following table shows the average number of probes required for retrieval and 
insertion under the unmodified Brent algorithm. The last column shows the number of 
retrievals per item required to make Brent's algorithm worthwhile (that is, so that the 
cumulative advantage on retrievals outweighs the disadvantage on insertion). 
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Load factor — Probes/ Probes/ — Breakeven numbers of 


% retrieval insertion retrieval/item 
20 1.10 tS 2.85 
60 1.37 1.92 2.48 
80 1.60 2.97 2.32 
99 1.80 4.27 2.26 
95 1.97 5.84 2.26 


As the table becomes full, approximately 2.5 probes per retrieval are required on the 
average, regardless of the table size. This compares very favorably with ordinary double 
hashing, in which retrieval from a full table requires O(log n) probes. 


Binary Tree Hashing 


Another method of improving Brent's algorithm is attributable to Gonnet and 
Munro and is called binary iree hashing. Again, we assume the use of double hash- 
ing. Every time a key is to be inserted into the table, an almost complete binary tree 
is constructed. Figure 7.4.2 illustrates an example of such a tree, in which the sodes 
are numbered according to the array representation of an almost complete binary 
tree, as outlined in Section 5.2 (that is, nede(0) is the root and node(2 * i + 1) and 
node(2 * i + 2) are the left and right sons of node(i)). (The details of this figure will 
be explained shortly.) Each node of the tree contains an index into the hash table. For 
purposes of this discussion, the hash table index contained in noeli) will be referred to 
as index(i), and the key at that position (that is, ftable[index(|.k) as kqi). key is referred 
to as k(—1). 

To explain how the tee is constructed, we first define the youngest right ancestor 
of node[i], or yra(i), as the node number of the father of the youngest ancestor of nede(1) 
that is aright son. For example, in Figure 7.4.2, yra(11) is O. since nede(11) (containing 
!) is a lett son and its father node(5) (containing f) is also a left son. Thus the youngest 
ancestor of nede(11) that is a right son is node(2) (containing ©), and its father is nede(O). 
Similarly, yra(19) is 1 and yra(17) is 3. If node(i is a right son, vra(i) is detined as the 
node number of its father, (i — 1)/2. Thus vra( 14) in Figure 7.4.2 is 6. If nodeti) has no 
ancestor that is a right son (as, for example, nodes 0, 1. 3, 7, and 15 of Figure 7.4.2), 
yra(i) is defined as — l. 

The binary tree is constructed in node number order. ince (0) is set to Atkey). 
index(i), for each subsequent i, is set to rh(index(i ~ 1)/2). ktyrali))). This process 
continues until &(/) (that is, rable[index(i)].4) equals NULLAKEY and an empty position 
is found in the table. 

For example, in Figure 7.4.2, key is hashed to obtain a = h(xey), which is es-. 
tablished as the index in the root node. Its left son is rh(a, key), and its right son 
isc = rh(a, table(a).k) = rh(a,k(0)). Similarly, the lett sog of bis d = rhib key), the 
right son of b is e = rh(b, fable(b).k) = rh(b.k(1)). the left son of c is 
f = rh(c, table(ay:k) = rh(c,k(0)) and the right son of ¢ is g = rh(c, table(c).k) = 
rh(c,k(2)). This continues until t = rh(j, table(h).k) = rh KE) is placed in nodel 19) 
and u = rh(j, table(j).k) = rh(j,k(9)) is placed in node(20). Since tablefu}.k (which 
is k(20)) is NULLKEY, an empty position in the hash table bas been found, and the 
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1 = rh(f, k(0)) 
m = rh(f, k(5)) 


c = ra, KO) n = rh(g, K(2)) 
d = rh(b, k(-1)) o = rhig, k(6)) 
e = rh(b, k(1)) p = rh(h, k(-1)) 
S = rhc, k(0)) q = rh(h, k(7)) 
& = rh(c, k(2)) r = rhti, k(3)) 
h = rd, k(-1)) s = rhi, k(8)) 
i = rh(d, k(3)) £ = rh, k0) 
J = rhle, k(1)) u = rh(j, k(9)) 
k = rhle, ki4)) k(20) = nullkey 


Figure 7.4.2 


tree construction is completed. Note that any path following a series of left pointers 
through the tree consists of successive rehashes of a particular key and that a right 
pointer indicates that a new key is being rehashed. 

Once the tree has been constructed, the keys along the path from the root to the 
last node are reordered in the hash table. Let be initialized to the position of the 
last node of the tree. Then if yra(i) is nonzero, k(yra(i)) and its associated record are 
moved from table[index(yra(i))] to table[index(i)] and i is reset to yra(i). This pro- 
cess is repeated until yra(i) is —1, at which point key and rec are inserted into ta- 
ble{index(i)] and the insertion is complete. For example, in Figure 7.4.2, yra(20) = 9 
and index(9) = j; consequently the key and record from position j of the hash table are 
moved to the previously empty position x. Then, since yra(9) = 1 and index(1) = b. 
the key and record from position b are moved to position j. Finally, since yra(1) = —1, 
key is inserted into position b. 

When subsequently searching for key. two table positions are probed: a and b. 
When searching for the former sable[b].k (now at table[j}). two additional probes are 
required. When searching for the former rablelj}.k (now at table[u]). one additional 
probe is required. Thus a total of five extra positions over the entire hash table contents 
must be probed as a result of the insertion of key, whereas at least six would have been 
required if key were inserted directly along its rehash path (consisting of a. b. d. h. pP- 
and at least one more position). Similarly. under Brent's method, no path shorter than 


Sec. 7.4 Hashing 481 


length 6 would have been found (considering paths abejt, abdir, abdhg, and abdhp, 
representing attempts to relocate b, d, h, and p, the values on the initial rehash path of a. 
Each of these paths requires one more position before an empty table element is found). 

Note also that if key had previously been inserted in the table using the Gonnet 
and Munro insertion algorithm, it would have been found along the leftmost path of the 
tree (in Figure 7.4.2, abdhp) before an empty table position would have been found at 
a different node. Thus the tree-building process can be initiated, in preparation for a 
possible insertion, as part of the search process. However, if insertions are infrequent, 
it may be desirable to construct the full left path of the tree until an empty position is 
found (that is, to perform a straight search for key) before building the remainder of the 
tree. 

Of course, the entire algorithm depends on the routine yra(i). Fortunately, yra(i) 
may be computed quite easily. yra(i) can be derived directly from the following method: 
Find the binary representation of i + 1. Delete any trailing zero bits and the one bit 
preceding them. Subtract | from the resulting binary number to get yra(i). For example, 
the bingry representation of 11 + 1 is 1100. Removing the trailing 100 yields 1, which is 
the binary representation of 1. Thus yra(11) = 0. Similarly, 17 + 1 in binary is 10010, 
which yields 100, or 4, so that yra(17) = 3; 14 + 1 in binary is 1111, which yields 111, 
or 7, so that yra(14) = 6; and 15 + 1 in binary is 10000 (or 010000), which yields 0, 
so that vra(15) = —1. You may confirm this in Figure 7.4.2. á 

Gonnet and Munro’s method yields results that are even clos :r to optimal than 
Brent’s. However, they are not quite optimal, since the hash table can only be rearranged 
by moving elements to later positions in their hash sequence, never to earlier positions. 
At 90 percent loading, binary tree hashing requires 1.75 probes per retrieval (compared 
with Brent's 1.80), and at 95 percent requires 1.88 (compared with 1.97). For a full 
table, 2.13 probes are required on average, compared with Brent’s 2.5. The maximum 
number of probes required to access an element under Brent's method is O(sqrt(n)), 
whereas under binary hashing it is O(log n). Note that the queue of Brent’s method and 
the tree of Gonnet and Munro’s can be reused for each insertion. If all insertions take 
place initially, and the table is subsequently required only for searches, the space for 
these data structures may be freed. 

If the hash table is static (that is, if elements are initially inserted and the table re- 
mains unchanged for a long series of searches), another reordering Strategy is to perform 
all the insertions initially and then to rearrange the elements of the table to absolutely 
minimize the expected retrieval costs. Experiments show that the minimum expected 
retrieval cost is 1.4 probes per retrieval with a load factor of .5, 1.5 for a load factor of 
.8, 1.7 for a load factor of .95, and 1.83 for a full table. Unfortunately, algorithms to op- 
timally reorder a table to achieve this minimum are O(: * tablesize~) and are therefore 
impractical for large numbers of keys. 


improvements with Additional Memory 
Thus far we have assumed that no additional memory is available in each table 
element. If additional memory is available, we can maintain some information in each 


entry to reduce the number of probes required to find a record or to determine that the 
desired record is absent. 
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Before looking at specific techniques, we should make one observation. The most 
obvious use to which additional memory can be put is to expand the size of thé hash 
table. This reduces the load factor and immediately improves efficiency. Therefore in 
evaluating any efficiency improvements caused by adding more information to each ta- 
ble entry, one must consider whether the improvement outweighs utilizing the memory 
to expand the table. 

On the other hand, the benefit of expanding each table entry by one or two bytes 
may indeed be worthwhile. Each table item (including space for the key and record) 
may require 10, 50, 100, or even 1000 bytes, so that utilizing the space to expand the 
table may not buy as much as utilizing the space for small increments in each table 
element. (In reality, long records would not be kept within a hash table, since empty 
table entries waste too much space. Instead, each table entry would contain the key and 
a pointer to the record. This could still require 30 or 40 bytes if the key were large and 
10 to 15 bytes for typical key sizes.) In addition, for technical reasons (for example, 
word size), not all of the space in a table entry may actually be used, so that there may 
be some extra space available that cannot be us. ` for additional table entries. In that 
case, whatever use can be made of the storage is beneficial. 

The first improvement that we consider reduces the time required for an unsuc- 
cessful search, but not that for a retrieval. It involves keeping with each table element a 
one-bit field whose value is initialized to 0 and is set to | whenever a key to be inserted 
hashes or rehashes to that position but the position is found occupied. When hashing or 
rehashing a key during a search and finding the bit still set to 0, we immediately know 
that the key is not in the table, since if it were, it would either be found in that position 
or the bit would have been reset to 1 when it or some other key had been inserted. Use 
of this technique together with the ordered hash table algorithm of Amble and Knuth 
reduces the average number of probes for an unsuccessful search in a table with a load 
factor of 95 percent from 10.5 to 10.3 using linear rehashing and from 3.15 to 2.2 us- 
ing double hashing. This method is called the pass-bit method, since the additional bit 
indicates whether a table element has been passed over while inserting an item. 

The next method can be used with both linear rehashing and quadratic rehash- 
ing. In both cases we can define a function prb(j, key) that directly computes the po- 
sition of the jth rehash of key, which is the position of the jth probe in searching for 
key. prb(0, key) is defined as h(kev). For linear rehashing [rh(i) = (i + c) % tablesize, 
where c is a constant), prb(j, key) is defined as (h(key) + j*c) % tablesize. For quadratic 
rehashing, prb(j, key) is defined as (h(key) + sqr(j)) % tablesize. Note that no such rou- 
tine can be defined for double hashing: therefore the method is not applicable to that 
technique. 

The method uses an additional integer field, called a predictor, in each table posi- 
tion, Let prd(i) be the predictor field in table position i. Initially, all predictor fields are 
0. Under linear rehashing, the predictor field is reset as follows. Suppose that key 1 is 
being inserted and that j is the smallest integer such that prb(j,kl) is a probe position 
whose predictor field prd(prb(j,k1)).is 0. Then, after k1 is rehashed several more times 
and is inserted in position prb(p,k1), prd(prb(j.k1)) is reset from Oto p - j. Then. during 
a search, when position prb(j.k1) is found not to contain k1, the next position examined 
is prbj + prd(prb(j.k1)), k1) or prb(p,k1) rather than prb(j + 1,41). This eliminates 
p—Jj-— | probes. 


Sec. 7.4 Hashing f 483 


An advantage of this approach is that it can be adapted quite easily when only 
a few extra bits are available in each table Position. Since the predictor field contains 
only the number of additional rehashes needed, in most cases this number is low and 
can fit in the available space. It would be very rare for a predictor value to be greater 
than the largest integer expressible in four or five bits. If only b bits are available for 
the prd field and the’predictor field cannot fit, the field value can be set to 2°— | (the 
largest integer representable by b bits). Then we would skip at least 2°—2 probes after 
reaching such a position. 

Under linear rehashing, suppose two keys, kl and k2, hash into different val- 
ues but the mth probe of one equals the nth probe of the other. [That is, prb(n,kl) = 
prb(m,k2), where n and m are unequal.} Suppose that k1 is inserted first into position 
i = prb(m,k1). Then when k2 is placed in Position prb(m + x, k2), prd(i) is set to x. 
This presents no problem, since prb(n + x,k1) and prb(m + x.k2) both equal (i + x*c) 
% tablesize. Thus anything that hashes into either h(k1) or h(k2) and rehashes into į can 
be referred to i + prd(i) for the next probe. This reflects the fact that linear rehashing 
involves primary clustering in which keys hashing into different locations follow the 
same rehash paths once those paths intersect. 

Under quadratic rehashing, however, primary clustering is eliminated. Thus. the 
paths followed by k1 and k2 differ if h(k1) does not equal A(k2), even after those paths 
intersect. Thus prb(n + x,k1), which equals (A(k1) + sgin + x)) % tablesize, does 
not equal prb(m + x,k2), which equals (i(k2) + sqrim + x)) % tablesize, even though 
prb(n.k1) does happen to equal prb(m.k2). Thus. if prd(prb(i.k1)) is set to x, and if 
we are searching for k2 at prb(j.k2), which happens to equal prb(i,k1), we cannot di- 
rectly go to prb(j + x,k2) unless h(k1) equals /(k2) (that is, kl and &2 are in the same 
secondary cluster) and i equals j. Therefore under quadratic rehashing or any other re- 
hashing method that involves only secondary clustering, we must ensure that A(k(i)) 
equals A(key) before using or setting prd(i) during ii search or insertion of key. If the 
two hash values are unequal. rehashing continues in the usual fashion until a location j 
is reached where h(k(j)) equals h(key), where use of the prd field can be resumed. 

Unfortunately, the predictor method cannot be used at all under double hash- 
ing. The reason for this is that even secondary clustering is eliminated, so that there 
is no guarantee that prb(n + xkl equals prb(n + x,k2) even if A(k1) equals h(k2) and 
prb(n,k1) equals prb(n.k2). 

An extension of the predictor method is the multiple predictor method. Under this 
technique, np predictor fields are maintained in each table position. A predictor hash 
routine ph(key), whose value is between 0 and np — 1, determines which predictor is 
used for a particular Key. The jth predictor in table position i is referenced as prd(ij). 
When a key probes an occupied slot i that equals prb(j. key) such that ph(k(i)) equals 
phikey), the next position probed is prb(j + prd(iph(key)), key), Similarly, if ph(k(i)) 
equals ph(key) and prd(iph(key)) is 0. we know that key is not in the table. If kev is 
inserted at prb(i + x, key). prdliph(key)) is set to x. 

The advantage of the multiple predictor method is similar to the advantages of 
double hashing; it eliminates the effects of Secondary clustering by dividing the list of 
elements that hash or rehash into a particular location into np separate and shorter lists. 

Sirwtation results of a slightly modified version of the predictor method using 
the quadratic rehash method are shown in the following table, which lists the average 
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number of probes required for a successful search under various load factors, with var- 
ious numbers of predictor fields and numbers of bits in each predictor. By comparison, 
recall that quadratic hashing without predictors required 1.44 average probes for a 50 
percent load factor and 2.85 for 90 percent, and that double hashing required 1.39 and 
2.56 probes, respectively. 


Load Bits in each predictor 
Number of factor 


predictors Ge 3 4 5 
o 
1 501.25) 125 125 
7 139 135 135 
90 183 1.55 1.46 
2 SO 124 IB 123 
7 135 132 1.3) 
90 4.79 150 14 
4 w wa iA i23 
70 133 130 130 
90 1.74 147 138 
8 W% 2 i2 2 
7 1.32. 129 129 


90 1.72. 1:46. 1.31 


As the number of bits in each predictor and the number of predictors grow very 
large, the average number of probes required for a successful search with a load factor if 
becomes 2 — (1 — exp(—If))/If. For a single full-integer predictor, the average number 
of probes is 1 + /f/2. The predictor method also reduces the average number of probes 
for unsuccessful searches. 


Coalesced Hashing 


Perhaps the simplest use of additional memory to reduce retrieval time is to add a 
link field to each table entry. This field contains the next position of the table to examine 
in searching for a particular item. In fact, under this method a rehash function is not 
required at all; the technique is therefore our first example of the second major method 
of collision resolution, called chaining. This method uses links rather than a rehash 
function to resolve hash clashes. 

The simplest of the chaining methods is called standard coalesced hashing. A 
search and insertion algorithm for this method can be presented as follows. Assume 
that each table entry contains a key field k initialized to NULLKEY and a next field 
initialized to — 1. The algorithm uses an auxiliary function gerempn, which returns the 
index of an empty table location. 


i = h{key); 

while (k(i) != key & next(i) >= 0) 
7 = next(i); 

if (k(i) == key) 
return(i); 


Sec. 7.4 Hashing j 485 


/* set j to the position where the new */ 
/* record is to be inserted */ 
if (k(i) == NULLKEY) 
/* the hash position is empty */ 
J=1; 
else { á 
j = getempty(); 
next(i) = j; 
} /* end if */ 
k(j) = key; 
r(j) = rec; 
return(j); 


The function getempry can use any technique to locate an empty position. The simplest 
method is to use a variable avail initialized to tablesize — | and to execute 


while (k(avaiT) != NULLKEY) 
avail--; 
return(avail); 


each time that getempry is called. When getempty is called, a}! table positions between 
avail and tablesize — 1 have been already allocated. getempry sequentially examines 
positions less than avail to locate the first empty position. avail is reset to that position, 
. which is then allocated. Of course, it may be desirable to avoid the nullkey comparisons 

in getempty. This can be done in a number of ways. An additional one-bit empty field 
can be added to each table position, or the next fields can be initialized to — 2 and 
modified to — 1 when keys are inserted in the table. An alternative solution is to link 
the free positions together in a list that acts as a stack. 

Figure 7.4.3 illustrates a table of ten elements that has been filled using standard 
coalesced hashing using the hash function key % 10. The keys were inserted in the 
order 14, 29, 34, 28, 42, 39, 84, and 38. Note that items hashing into both 4 and 8 have 
coalesced into a single list (in positions 4, 8, 7, 5, and 3, containing items 14, 34, 28, 
84, and 38). This is how the method gets its name. 

There are several advantages to standard coalesced hashing. First, it reduces 
the average number of probes to approximately exp (2 * /f)/4 — [ff2 + 0.75 for an un-. 
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successful search, and to approximately (exp (2 * If) — 1)/(8 * If) + Iff4 + 0.75 for 
a successful search. (exp (x), available in the standard library math.h, computes the 
value of e*.) This compares favorably with all previous methods other than the predic- 
tor method. A full table requires only an average of 1.8 probes to locate an item, and 
2.1 probes to determine that an item is not in the table. A fraction of approximately 
1 — [ff2 of items in the table can be found on the first probe. 

Another major advantage of chaining methods is that they permit efficient dele- 
tion without penalizing the efficiency of subsequent retrievals. An item being deleted 
can be removed from its list, its position in the table freed, and avail reset to the 
following position (unless avail already points to a position later in the table). This 
may slow down the second subsequent insertion somewhat by forcing avail to be 
repeatedly decrementexu through a long series of occupied positions, but that is not 
very significant. If the free table positions are kept in a linked list, this penalty also 
disappears. : 

A variation of standard coalesced hashing inserts a new element into its chain 
immediately following the item at its hash location rather than at the end of the chain. 
This technique, called early insertion standard coalesced hashing, or EISCH, requires 
the samé average number of probes as ordinary standard coalesced hashing for an un- 
successful search, but fewer probes (approximately (exp (If) — 1)/If) for a successful 
search. In a full table, EISCH requires approximately 5 percent fewer probes for a suc- 
cessful search. 

A generalization of the standard coalesced hashing method, which we call general 
coalesced hashing, adds extra positions to the hash table that can be used for list nodes 
in the case of collisions, but not for initial hash locations. Thus the table would consist 
of t entries (numbered 0 to t — 1), but keys would hash only into one of m < t values 
(0 to.m — 1). The extra £ — m positions are called the cellar and are available for storing 
items whose hash positions are full. ; 

Using a ceNar results in less conflict between lists of items with different hash 
values and therefore reduces the lengths of the lists. However, a cellar that is too large 
could increase-tst lengths relative to what they might be if the cellar positions were 
permitted as hash locations. For full tables, lowest average successful search time is 
achieved if the ratio m/t is 0.853 (that is, approximately 15 percent of the table is used 
as a cellar). In that case the average number of probes is only 1.69. Lowest average 
unsuccessful search time is attained if the ratio is 0.782, in which case only 1.79 probes 
are required for the average unsuccessful search. Higher values of m/t produce lowest 
successful and unsuccessful search times for lower load factors (when the table is not 
full). - 

Unlike the situation with standard coalesced hashing, the early insertion method 
yields worse retrieval times than if elements are added at the end of the chain in gen- 
eral coalesced hashing. A combination of the two techniques, called varied insertion 
coalesced hashing, seems to yield the best results. Under this method a colliding el- 
ement is ordinarily inserted in the list immediately following its hash position, as in 
the early insertion method, unless thé list emanating from that position contains a cellar 
element. When that occurs, the varied insertion method inserts the collider after the last 
cellar position in the chain. However, the extra overhead of varied insertion and early 
insertion often make them less useful in practice. 


Sec. 7.4 Hashing 487 


Separate Chaining 


Both rehashing and coalesced hashing assume fixed table sizes determined in 
advance. If the number of records grows beyond the number of table Positions, it is 
impossible to insert them without allocating a larger table and recomputing the hash 
values of the keys of all records’ already in the table using a new hash function. (In 
general coalesced hashing, the old table can be copied into the first half of the new 
table and the remaining portion of the new table used to enlarge the cellar, so that items 
do not have to be rehashed.) To avoid the possibility of running out of room, too many 
locations may be initially allocated for a hash table. resulting in much wasted space. 

Another method of resolving hash clashes is called Separate chaining and in- 
volves keeping a distinct linked list for all records whose keys hash into a particular 
value. Suppose that the hash routine produces values between 0 and tablesize — 1. Then 
an array bucket of header nodes of size tablesize is declared. This array is called the 
hash table. bucket{i} points to the list of all records whose keys hash into i. In searching 
for a record, the list head bucket{i] is accessed and the list that it initiates is traversed. 
If the record is not found, it is inserted at the end of the list. Figure 7.4.4 illustrates 
separate chaining. We assume a ten-element array and the hash routine key % 10. The 
keys if that figure are presented in the order 


75 66 42 192 91 40 49 87 67 16 417 130 372 227 


We may write a search and insertion function for separate chaining using a hash 
function h, an array bucket, and nodes that contain three fields: k for the key, r for the 


record, and next as a pointer to the next node in the list. getnode is used to allocate a 
new list node. 


Struct nodetype *search(KEYTYPE key, RECTYPE rec) 
{ 3 
struct nodetype *p, *q, *s; 


i = h(key); 
q = NULL; 
p = bucket[i]; 
_ While (p != NULL && p->k != key) { 
q =p; 
p = p->next; 
} /* end while */ 
if (p->k == key) 
return(p); 
/* insert a new record */ 
s = getnode(); 
` S->k = key; 
S->r = rec; 
S=>next = NULL; 
if (q == NULL) 
bucket[i] = s; 
else 
q->next = s; 
return(s); 
} /* end search */ 
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Note that the lists may be reordered dynamically for more efficient searching by the 
methods of Section 7.1. The time for unsuccessful searches can be reduced by keeping 
each of the lists ordered by key. Then only half the list need be traversed on the average 
to determine that an item is missing. 
The primary disadvantage of chaining is the extra space that is required for the 
hash table and pointers. However, the initial array is usually smaller in schemes that 
use separate chaining than in those that use rehashing or coalesced hashing. This is 
because under separate chaining it is less catastrophic if the entire array becomes full; 
it is always possible to allocate more nodes and make them available to various lists. Of 
course, if the lists become very long, the whole purpose of hashing—direct addressing 
and resultant search efficiency—is defeated. One advantage of separate chaining is 
that the list items need not be in contiguous storage. Another advantage over all other 
hashing methods is that separate chaining allows traversal of the items in hash-key 
order, although not in sequential key order. 
There is a technique that can be used to save some space in seperate chain- 
ing. Consider the hash routine h(key) = key % tablesize. Then if we define p(key) 
= key/tablesize, we can store p(key) rather than key in the k field of each list node. 
When searching for key, we compute h(key). Since key equals p(key) * tablesize + 
_ h(key), we can recompute the key value in each list node from the value of p(key) 
stored in the k field. Since p(key) requires less space than key, space is saved in each 

“ list node. The technique can be used for separate chaining but not for any of the 
other hashing methods because only in separate chaining is the value of h(key) for the 
key stored in a particular position being probed always known with certainty during 
the search process. However, this technique is not of much use for integer keys in a 
language like C in which the size of an integer is fixed by the computer implemen- 
tation. i 

Another space-efficiency decision that must be made is whether the hash table 
should be simply list headers (as we have presented it) or whether each list header 
should itself also contain a key and a record (or record pointer). Space would be wasted 
for empty array items but gained for full ones. 

The average number of probes required to locate an existing item under the sepa- 
rate chaining technique is approximately 1 + /f/2. The average number required for an 
unsuccessful Search is approximately exp(—/f) + If if the lists are not kept ordered, and 
approximately 1 + /f/2 — (1 — exp(—lf If + exp(—If) if they are kept ordered. If there 
are tablesize keys and tablesize elements in the hash table (for a load factor of 1), this 
translates to 1.5 average probes per successful search, 1.27 probes for an unsuccessful 
search if unordered lists are used, and 1.05 probes for an unsuccessful search if ordered 
lists are used. Note that the average for an unsuccessful search is actually lower than for 

"a successful search, since when an array element is empty an unsuccessful search to that 
element involves zero probes, whereas a successful search requires at least one probe. 
(This assumes that a key is not képt in each element, since an unsuccessful search to 
an empty element Would then require a key comparison with NULLKEY.) 

One technique that can be used to reduce the number of probes in separate chain- 

- ing is to maintain the records that hash into the same value as a binary search tree 

emanating from the hash bucket rather than as a linked list. However, this requires two 
pointers to be kept with each record. Since chains are usually small (otherwise, the init- 
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ial bucket table should be larger), the added space and programming complexity do not 
seem to be warranted. 

Although the average number of probes for separate chaining appears to be quite 
low, the numbers are deceiving. The reason for this is that the load factor is detined as 
the number of keys in the table divided by the number of positions in the table. But, 
in separate chaining, the number of positions in the hash table is not a valid measure 
of space utilization, since the keys are not stored in the hash table but in the list nodes. 
Indeed, the formulas for search time remain valid even it /f is greater than 1. For exam- 
ple, if there are five times as many keys as buckets, If = 5, and the average number of 
probes for a successful search is 1 + /f/2 or 3.5. Thus the total space allocated to the hash 


“table should be adjusted to include the list nodes. When this is done, coalesced hash- 


ing is quite competitive with separate chaining. Note also that rehashing with multiple 
predictors also performs better than separate chaining. 


Hashing in External Storage 


If a hash table is maintained in external storage on a disk or some other direct- 
access device, time rather than space is the critical factor. Most systems have sufficient 
external storage to allow the luxury of unused allocatea space for growth but cannot 
afford the time needed to perform an I/O operation ror every element on a linked list. In 
such a situation the table in external storage is divided into a number of blocks called 
buckets. Each bucket consists of a useful physical segment of external Storage such as 
a page or a disk track or track fraction. The buckets ure usually contiguous and can be 
accessed by bucket offsets from 0 to zablesize — | that serve as hash values, much like 
indexes of an array in internal storage. 

Alternatively, one or more contiguous storage blocks can be used’as a hash table 
containing pointers to buckets distributed noncontiguously. In that situation the hash 
table is most likely read into memory as soon as the file is opened (or upon the first record 
read) and remains in memory until the file is closed. When a record is requested, its key 
is hashed and the hash table (now in internal memory) is used to locate the external 
storage address of the appropriate bucket. Such a hash table is often called an index 
(not to be confused with the term “index” as used to refer to a particular table position). 

Each external memory bucket contains room for a moderate number of records {ia 
practical situations, from 10 to 100). An entire bucket is read into memory at once and 
sequentially searched for the appropriate record. (Of course, a binary search or some 
other appropriate search mechanism based on the internal organization of the records 
within the bucket can be used, but the number of records in a bucket is usually small 
enough that no significant advaatage is gained.) 

We should note that when dealing with external storage, the computational efi- 
ciency of a hash function is not as important as its success at avoiding hash clashes. It 
is more efficiegt to spend microseconds computing a complex hash function at inter- 
nal CPU speeds than milliseconds or longer accessing additional buckets at 1/0 speeds 
when a bucket overflows. We also note that external storage space is usually inexpen- 
sive. Thus, the number of contiguous initial buckets or the size of the hash table should 
be chosen so that it is unlikely that any cf the buckets become full, even though this en- 
tails allocating unused space. Then when a new record must be inserted. there usually 
is room in the appropriate bucket, and an additional expensive I/O is not required. 
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If a bucket is full and a record must be inserted, any of the rehash or chaining 
techniques discussed previously can be used. Of course, additional 1/0 operations are 
required when searching for records that are not in the buckets directly corresponding 
to the hash value. The size of the hash table (which equals the number of buckets that 
are accessible in one I/O operation) is crucial. A hash table that is too large implies that 
most buckets will be empty, and a great deal of space is wasted. A hash table that is 
too small implies that buckets will be full, and large numbers of I/O operations will be 
required to access many records. If a file is very volatile, growing and shrinking rapidly 
and unpredictably, this simple hashing technique is inefficient in either space or time. 
We will see how to deal with this situation shortly. 

The following table indicates the expected number of external storage accesses 
per successful search under linear rehashing, double rehashing, and separate chaining 
for various bucket sizes and load factors (the load factor is defined as the number of 
records in the file divided by the product of the number of buckets and the bucket size). 


ee 
Bucket Lvad Linear Double Separate 
size factor rehashing hashing chaining 


1 S 1.500 1.386 1.250 
8 3.000 2.012 1.400 
95 10.5 3.153 1.5 

5 3 1.031 1.028 1.036 
8 1.289 1.184 1.186 
95 2.7 1.529 1.3 

10 5 1.005 1.005 1.007 
8 1.110 1.079 1.115 
95 1.8 1.292 1.3 

50 5 1.000 1.000 1.000 
& 1.005 1.005 1.015 
95 11 1,067 1.2 


This table indicates that double hashing is the preferred method for moderate or large 
bucket sizes. 

However, when dealing with external storage such as a disk, the number of buck- 
ets that have to be read from external storage is not the only determinant of access 
efficiency. Another important factor is the dispersal of the buckets accessed—that is. 
how far apart the buckets accessed are from each other. In general, a major factor in 
the time it takes to read a block from a disk is the seek time. This is the time it takes 
for the disk head to move to the location of the desired data on the disk. If two buckets 
accessed one after the other are far apart, more time is required than if they are close 
together. Given this fact, it would seem that linear rehashing is the most effective tech- 
nique because althpugh it may require accessing more buckets, the buckets it a¢cesses 
are contiguous (assuming that c = 1 in the linear rehash so that if a record is not in 
a full bucket, the next sequential bucket is checked). Surprisingly, the table indicates 
that fewer buckets are accessed under linear rehashing than under separate chaining for 
large bucket sizes. 
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If separate chaining is used, it is desirable to reserve an overflow area in each 
cylinder of the file so that full buckets in that cylinder can link to overflow records in 
the same cylinder, thus minimizing seek time and essentially eliminating the dispersal 
penalty. It should be noted that the overflow area need not be organized into buck- 
ets and should be organized as individual records with links. In general. few records 
overflow, and there is only a small chance that sufficiently many will overflow from a 
single bucket to fill an additional complete bucket. Thus, by keeping individual over- 
flow records, more buckets can overflow into the same cylinder. Since space is reserved 
within the file for overflow records, the load factor docs not represent a true picture of 
Storage utilization for this version of separate chaining. Thé number of accesses in sepa- 
rate chaining is therefore higher for a given amount of external storage than the numbers 
in the foregoing table would indicate. 

Although double hashing requires fewer accesses than linear rehashing, it dis- 
perses the buckets that must be accessed to a degree that may overwhelm this advan- 
tage. However, in systems where dispersal is not a factor, double hashing is preferred. 
This is true of modern large multiuser systems in which many users may be requesting 
access to a disk simultaneously. and the requests are scheduled by the operating system 
based on the way the data is arranged on the disk. In such situations, waiting time for 
disk access is required in any case, ṣo that dispersal is not a significant factor. 

The major drawback in using hashing for external file storage is that sequential 
access (in ascending key order) is not possible, since a good hash function disperses 
keys without regard to order. Access to records in key-sequential order is particularly 
important in external file systems. 


Separator Method 


One technique for reducing access time in external hash tables at the expense of 
increasing insertion time is attributable to Gonnet and Larson. We will call the tech- 
nique the separator method. The method uses rehashing (either linear rehashing or 
double hashing) to resolve collisions but also uses an additional hash routine, s, called 
the signature function. Given a key key. let h(key, i) be the ith rehash of key, and let 
s(key, i) be the ith signature of key. If a record with key key is stored in bucket number 
h(key, j). the current signature of the record and the key, sig(kev), is defined as 
stkey, j). That is. if a record is placed in a bucket corresponding to its key`s jth rehash, 
its current signature is its key’s jth signature. : 

A separator table, sep, is maintained in internal memory. If b is a bucket number, 
sepib) contains a signature value greater than the current signature of every record in 
bucket(b), To aceesgthe record with key key, repeatedly hash key until obtaining a value 
j such that sep(h(ey, j)) > stkev. j). At that point, if the record is in the file it must be 
in bucket(h(key,j, )). This ensures the ability to access any record in the file with only a 
single extemal memory access. 

If m is the number of bits allowed in each item of the separator table (so that it can 
hold values between 0 and 2”"— 1), the Signature function, s, is restricted to producing 
Values between 0 and 2”"— 2. Initially, before any overflows have occurred in bucket 
b, the value of sep(b) is set to 2"— 1, so that any record whose key hashes to b can be 
inserted directly into buckenb) regardless of its signature. Now, suppose that bucket(b) 
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is full and a new record to be inserted hashes into b. [That is, h(key, j) equals b, and j 
is the smallest integer such that sep(h(key, j)) > s(key, j).] Then the records in b with 
the largest current signature Ics must be removed from bucket(b) to make room for the 
new record. The new record is then inserted into bucket(b) and the old records that 
had current signature /cs and were removed from bucket b are rehashed and relocated 
into new buckets (with new current signatures, of course). sep(b) is then reset to Ics, 
since the current signatures of all records in bucket(b) are less than /cs. Future keys 
are directed to bucket(b) only if their signatures are less than /cs. Notice that more than 
one record may have to be removed from a bucket if they have equal maximal current 
signature values. This may leave a bucket with some remaining space after an insertion 
causes it to overflow. 

Records that overflow from a bucket during an insertion may cause cascading 
overflows in other buckets when attempting to relocate them. This means that an inser- 
tion may cause an indefinite number of additional external storage reads and writes. In 
practice, a limit is placed on the number of such cascading overflows beyond which the 
insertion fails. If the insertion does fail, it is necessary to restore the file to the status it 
was in before inserting the new record that caused the original overflow. This is usually 
done by delaying writing modified buckets to external storage, keeping the modified 
versions in internal memory until it is determined that the insertion can be completed 
successfully, If the insertion is aborted because the cascade limit is reached. no writes 
are done. leaving the file in its original stace. 

With 40 buckets per record, four-bit signature values and a load factor of 90 per- 
cent. an average of no more than two pages need be modified per insertion under this 
method. However, the number of modified pages per insertion rises rapidly as the load 
factor is increased, so that the technique is impractical with a load factor greater than 95 
percent. Larger signature values and larger bucket sizes permit the method to be used 
with larger load factors. 


Dynamic Hashing and Extendible Hashing 


One of the most serious drawbacks of hashing for external storage is that it is 
insufficiently flexible. Unlike internal data structures, files and databases are semiper- 
manent structures that are not usually created and destroyed within the lifetime of a 
single program. Further, the contents of an external storage structure tend to grow and 
shrink unpredictably. All the hash table structuring methods that we have examined 
have a sharp space/time trade-off. Either the table uses a large amount of space for 
efficient access, resulting in much wasted space when the structure shrinks, or it uses 
a small amount of space and accommodates growth very poorly by sharply increas- 
ing the access time for overflow elements. We would like to develop a scheme that 
does not utilize too much extra space when a file is small but permits efficient ac- 
cess When it grows larger. Two such schemes are called dynamic hashing, attributable 
to Larson, and extendible hashing, attributable to Fagin, Nievergelt, Pippenger, and 
Strong. 

The basic concept under both methods is the same. Initially. m buckets and a 
hash table (or index) of size m are allocated. Assume that m equals 2°, and assume a 
hash routine / that produces hash values that are w > b bits in length. Let hLikey) be 
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the integer between 0 and m represented by the first b bits of A(key). Then, initially, hb 
is used as the hash routine, fand records:are inserted into the m buckets as in ordinary 
external storage hashing. : 

When a bucket overflows, the bucket is split in two and its records are assigned 
to the two new buckets based on the (b + 1)st bit of h(key). If the bit is 0, the record 
is assigned to the first (or left) new bucket: if the bit is 1, the record is assigned to the 
second (or right) bucket. (Of course. the original bucket can be reused as one of the 
two new buckets.) The records in each of the two new buckets now all have the same 
first b + 1 bits in their hash keys. h(key). Similarly, when a bucket representing ` bits 
overflows (where b <= i <= w), the bucket is split and the (i + 1)st bit of htkev) for 
each record in the bucket is used to place the record in the left or ri ght new bucket. Both 
new buckets then represent i + | bits of the hash key. We call the bucket whose keys 
have 0 in their (i + 1)st bit the 0-bucket and the other bucket the J-bucket. 

Dynamic hashing and extendible hashing differ as to how the index is modified 
when a bucket splits. Under dynamic hashing, each of the m original index entries 
represents the root of a binary tree each of whose leaves contains a pointer to a bucket. 
Initially each tree consists of only one node (a leaf node) that points to one of the m 
initially allocated buckets. When a bucket splits. two new leaf nodes are created to 
point to the two new buckets. The former leaf that had pointed to the bucket being split 
is transformed into a nonleaf node whose left son is the leaf pointing to the 0-bucket 
and whose right son is the leaf pointing to the |-bucket. Dynamic hashing with b = 2 
(m = 4) is illustrated in Figure 7.4.5. _ 

To locate a record under-dynamic hashing, compute h(key) and use the first b bits 
to locate a root node in the original index. Then use each successive bit of h(key) to 
move down the tree, going left if the bit is O and right if the bit is 1, until a leaf is 
reached. Then use the pointer in the leaf to locate the bucket that contains the desired 
record, if it exists.’ F 

» In extendible hashing, each bucket contains an indication of the number of bits of 
h(key) that determine which records are in that bucket. This number is called the bucket 
depth. Initially. this number is b for all bucket entries: it is increased by | each time a 
bucket splits. Associated with the index is the index depth, d. which is the maximum 
of all the bucket depths. The size of the index is always 24 (initially, 2). 

Suppose that a bucket of depth / is to be split. Let al.a2,.... ai (where each qj is 
either 0 or 1) be the first i bits of A(key) for the records in the bucket being split. There 
are two cases to consider: į < d and i = d. If i < d (so that the bucket depth is being 
increased to i + 1 but the index depth remains at d), all index positions with bit values 


C E 4 ai00 ...0 (up to a bit size of d) through al, a2. .... aiO1 ... 1 of the index 
(that is, all positions starting with al, .... ai0) are reset to point to the 0-bucket, and 
index positions with bit values al.a2...., ail0...0 through al.a2,.... alate, l 


position al... adO is set to point to the new 0-bucket. and the contents of index position 
al... ad! to point to the new 1-bucket. Extendible hashing is illustrated in Figure 7.4.6. 
Figure 7.4.6a illustrates a configuration with index depth 4. Figure 7.4.6b illustrates an 
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Figure 7.4.5 Dynamic hashing with b = 2. 


£ 2 8 9 4 5 
TE “Po j 
| (b) Bucket 3 overflows ; 


Depth = $ 


— 


O 123 4 56 7 8 9 10 1) 12-13 16 15 16 17 18 19 20 21 22 23 24. 25 26°27 28 29 30 31 
ett ELT a ct baal el et | 


| : 

i 

at 2 8 7 

| C A CT “[loce.] "foe "oor ‘Po E 
j d (c) Bucket $ overflows i 


Figure 7.4.6 Extendible hashing 


overflow that does not increase the index depth, and Figure 7.4.6c illustrates an overflow 
that does. J 

To locate a record under extendible hashing, compute h(key) and use the first d 
bits (where d is the index depth) to obtain a position in the index. The contents of this 
position point to the. bucket containing the desired record, if it exists. 

Under both dynamic and extendible hashing, if the entire index is maintained in 
internal storage, only one I/O operation is required to locate a record regardless of how 
large the file grows. When a file shrinks, buckets can be combined and freed and the 
index size-can be reduced. Thus these methods achieve the twin goals of efficient space 
utilization and efficient access. Both schemes also allow effective sequential traversal 
of records in hash-key order. : 

However, neither method permits traversal in key order, and this often prevents 
the practical use of the techniques for file implementation. Of course, one could use 
the key itself as a hash value or some other order-preserving hash function, but such 
functions are usually nonuniform. The lack of uniformity is not as serious an obstacle 
under these methods as under static hashing methods, since any number of bits can 
be used. The more bits used, the less likely that two keys clash. Although too large a 
number of bits can result in too large an index for practical use, dynamic hashing, which 
does not use as large an index as extendible hashing, may indeed be practical with a 
nonuniform hash function. 

A simple variation of extendible hashing. in which the last bits of the hash key 
rather than the first are used to locate a bucket, is also possible. Such a variation sim- 
plifies doubling the index, since it allows merely copying the first half of the new index 
into the second and only modifying the two entries pointing to the two new buckets. 
However. such a scheme would not permit traversal even in hash-key sequence. 

One suggestion for a hashing technique for use with these methods is to use a ran- 
dom number generator to produce an arbitrarily long sequence of Os and 1s as needed, 
with the key or some function thereof as the seed. The same sequence would be pro- 
duced for the same key’every time, but there is no limit to the extendibility. This has 
the advantage of allowing the file to grow arbitrarily large, and in the case of dynamic 
hashing, to ensure balanced trees. 

` In comparing dynamic and extendible hashing, we note that extendible hashing 
is more time efficient, since a tree path need not be traversed as in dynamic hashing. 
However, if the entire index is kept in memory, the time spent in traversing the tree path 
does not involve any I/Os. Traversal time is therefore likely to be negligible compared 
with the time for accessing the bucket. The maximum number of tree nodes required in 
dynamic hashing is 27 — 1, assuming n buckets, whereas there may be as many as 2”~! 
index entries required under extendible hashing. However. usually fewer than twice as 
many extendible hashing index entries as dynamic hashing tree nodes are required, and 
the tree nodes require two pointer fields compared with one for each extendible hashing 
index entry. Thus the two methods are comparable in average internal space utilization. 

It is also possible to compress very large extendible hashing indexes by keeping 
only one copy of each bucket pointer and maintaining from/to indicators. Another point 
to note is that extendible hashing performs the same way regardless of the value of m. 
the initial number of index entries, whereas dynamic hashing requires longer tree paths 
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if m is smaller. In fact, there is no reason Not to initialize m to O (that is, b = 0) with a 
single empty bucket in extendible hashing, other than for contiguity of. the buckets in 
external storage. 

External storage utilization of both dynamic and balanced hashing is approxi- 
mately 69 percent on the average, which is the same as is achieved with B-trees. How- 
ever, the storage utilization of the hashing methods oscillates far more sharply and 
persists longer than for B-trees, so that there is a period of low utilization (approxi- 
mately 50 percent) after buckets are split as they begin filling up. This is followed bya 
period of high utilization (approaching 90-100 percent) as new records are uniformly 
distributed into the buckets and they all become full more or less simultaneously. Fi- 
nally, a short period of intensive Splitting is observed, after which utilization is again 
low. i 

The reason for this oscillation is that the hash routine is expected to be uniform so 
that all buckets are filled at approximately the same time. It may be desirable to mini- 
mize this oscillation by purposely introducing som- nonuniformity in the hash function. 
However, if this is done in extendible hashing, the nonuniformity could cause extremely 
large indexes. This problem can be solved by keeping a compressed version of the index 
as noted. 

To achieve storage utilization higher than 69 percent, it is possible to utilize over- 
flow buckets. When a bucket becomes full and an additional record is to be inserted, an 
overflow bucket is allocated and linked to the full bucket. When the overflow bucket 
also fills up, both buckets’ contents are redistributed into two nonoverfiow buckets, 
with the overflow bucket linked to the new bucket with more than half of the records. 
This results in more than two full buckets of data distributed into three buckets (two 
regular, one overflow), yielding a minimum utilization of 67 percent, rather than 50 
percent, once the initial buckets are filled. Of Course, it is also possible to use over- 
flow buckets that are smaller than regular buckets (in which case not every split results 
in an overflow bucket remaining allocated), as well as to allow an overflow bucket to 
contain records that have overflowed from more than one bucket. This latter technique, 
however, complicates traversal in hash-key order. 

Another method of increasing space utilization is to allow overflow records to be 
placed in a full bucket’s brother (under extendible hashing, the brother of a 0 bucket 
is its Corresponding 1 bucket and vice versa). This method also complicates traversal 
in hash-key order. Both the use of overflow buckets and the use of a brother bucket 
for overflow records increase the average number of accesses required for a search. 
However, if brother buckets are kept contiguous, this penalty may not be great. 


Linear Hashing 


One drawback of dynamic and extendible hashing is the need for an index. Al- 
though the index may be kept in internal Storage once the file is opened, this is not 
always possible ifthe index becomes very large. Also. the index does require external 

: Storage when the file is not in use. In addition, the-external copy of the index may have 


to be constantly updated to guard against power failure or other interruption that would 
prevent rewriting the index when the file is closed. 
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Another technique, linear hashing (not to be confused with lingar rehashing), 
proposed by Litwin and modified by Larson, permits a hash table to expand and shrink 
dynamically without requiring an index. However, the basic technique does require the 
use of overflow. buckets, unlike dynamic and extendible hashing. The version that we 
present is called linear hashing with two partial expansions, or LH2P. 

Under LH2P the initial file consists of m buckets, where m is even, numbered 
0 to m — 1. The file is considered divided into ng groups of buckets, numbered 0 to 
ng — 1. Initially, there are m/2 groups of buckets (ng = m/2), each consisting of two 
buckets. Group i initially consists of buckets i and i + ng. For example, if m equals 6, the 
file initially contains six buckets, numbered 0 to 5. There are three groups: group 0 con- 
tains buckets 0 and 3, group 1 contains buckets 1 and 4, and group 2 contains buckets 2 
and 5. ` l 

The file grows in two ways: overflow growth and regular growth. Unlike B-trees, 
dynamic hashing, or extendible hashing, a bucket is not split when it overflows. Instead, 
an overflow mechanism is used to contain the overflowing records from a particular 
bucket. This mechanism can utilize any of the techniques discussed earlier. 

Regular growth takes place by expanding the size of the file by one bucket at 
a time. Expansion of the file by one bucket is called a simple expansion. A sim- 
ple expansion takes place whenever the load factor (defined as the total number of 
records in the file divided by the number of records that fit into regular, nonoverflow 
buckets) exceeds a threshold percentage. When a simple expansion takes place, the 
number of regular buckets in the file, nb, increases by 1. At any time, the file con- 
sists of buckets O through nb — 1 plus any overflow buckets or records. Initially, nb 
equals m. ` l 

Regular growth under LH2P takes place in a series of simple expansions, grouped 
into partial expansions and full expansions. Each full expansion doubles the number of 
regular buckets in the file and consists of two partial expansions. The first partial expan- 
sion increases the number of regular buckets by 50 percent, and the second increases the 
number by the same amount. Thus, after the first partial expansion, nb equals 3 * m/ 2; 
after the first full expansion, nb equals 2 * m; and after the second full expansion, nb 
equals 4 * m. 

Each simple expansion increases the size of a particular group of buckets by 
one bucket. The variable nextgroup always holds the number of the next group 
to be expanded (initially, nextgroup, is 0). During the first partial expansion, two- 
bucket groups are expanded to three buckets by moving some records from buck- 
ets nextgroup and nextgroup + ng (and some of their associated overflow records) 
into bucket nextgroup +2 * ng. (Exactly which records are moved to the new bucket 
and which remain in place will be discussed shortly.) Note that the buckets are num- 
bered from 0 to nb — 1, so that nb is always the number of the next bucket added . 
to the file. During the first partial expansion nb always equals nextgroup +2 * ng. 
(Initially, nexrgroup = 0 and ng = m/2, so nb = m.) After a group has been ex- 
panded, nextgroup and nb are both increased by 1, and the next group is ready for 
expansion. ; : 

After the first partial expansion, all ng groups have been expanded and nextgroup 
is reset to 0, Each group now contains three regular buckets rather than two, and the . 
file size (in number of buckets) has grown by 50 percent. 
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During the second expansion, nb always equals nextgroup +3 * ng. (At the 
start of a second partial expansion, nextgroup is 0 and nb equals 3 * ng.) Three- 
` bucket groups are expanded to four by moving some records from buckets nextgroup, 
nextgroup + ng, and nextgroup +2* ng (and some of their associated overflow records) 
into bucket nextgroup +3 * ng (which equals nb during the second partial expansion). 
Any overflow records not moved to bucket nb during an expansion are moved back 
into.their home bucket, if there is room. 

After the second partial expansion, the file size has doubled and a full expansion 
has taken place. In preparation for the next full expansion, the number of groups (ng) 
. is doubled and the size of each groups is halved, from four to two. That is, group i, 

‘consisting of buckets i,i + j,i +2» J, and i + 3 * j (where j is the old value of ng), 
is now viewed as two separate groups: group / consisting of buckets i and i +2» j and 
group.i + j constisting of buckets i + j and i + 3 * j. The first partial expansion of the 
next full expansion then begins. 

Note that the group that is expanded is always the next sequential group and is in- 
dependent of whether or not overflow has taken place in that group. Overflow is handled 
by a separate mechanism from expansion. 

A key question is how a hash function is used to access a record directly. When 
the file contains nb buckets, the hash function must produce a value between O and 
nb — 1, but when the file size is increased by one bucket, it must produce:a value be- 
tween 0 and nb. Further. a record moved from bucket i to bucket j must have previously 
hashed into i and must henceforth hash into j. The hash function must also be used to 
determine whether or not a record should be moved during an expansion. The following 
method allows direct access to a record's bucket. Although it does involve many CPU 
operations, it only requires one I/O operation to obtain the appropriate bucket from a` 

“nonoverflow record. ` 

A function A1(key) that produces values in the range of 0 to m — 1 is used as a 
direct hashing function initially, before any expansions have taken place. The value 

h\(key) is called the initial hash of key. We also assume a function h2(key, i) for ' 
i > 0 that uniformly produces values in the range | to 4. The value of h2(key, i) de- 
termines: whether the record with key key is moved during the ith full expansion and, 
if so, whether it is moved to the first or the second expansion bucket of its group. If 
h2(key, i)-is 1 or 2, the record is not moved during the ith full expansion; if h2(key, i) 
is 3, it is moved to the first expansion bucket of its group in the ith full expansion; if 
h2(key, i) is 4, it is moved to the second expansion bucket. Thus a random record has a 
50 percent chance of being moved during a full expansion and a 25 percent chance of 
being moved to either of the two expansion buckets. 

During the first partial expansion of the ith full expansion, the hashing algorithm 
examines the values h2(key, i), h2(Key, i + 1), h2(key, i + 2), and so on, seeking the first 
value less than 4. If that value is | or 2, the record is not moved: if it is 3, itis moved 
to the single expansion bucket. It will Stay there after the second partial expansion 
if h(key; i) is 3; it will be moved to the second partial expansion bucket by the second 
partial expansion if A2(key, i) is 4. Thus a random record has a one-third chance of being 
moved to the third bucket of a group in a first partial expansion and a one-quarter chance 
of being moved to the fourth bucket in a second partial expansion. This guarantees that 
the records are distributed unif6rmly throughout the file. 
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Define the level of an LH2P file as the number of full expansions that have taken 
place, and let the variable./evel contain the level of the file. If pe is the number of the 
current partial expansion (either | or 2), pe + 1 is the number of buckets in a group that 
has not yet been expanded in the current partial expansion, and pe + 2 is the number 
of buckets in a group that has been expanded. At all times, 


ng =m t 2level-1 
and 
nb = nextgroup + (pe + 1) * ng; 


The values m, level, nextgroup, and pe therefore define the state of an LH2P file and 
. must be kept with the file at all times. 

To locate the address of a record, it is necessary to follow its relocations through 
the /evel full expansions, the pe — 1 completed partial expansions, and the nextgroup 
compleied simple expansions of the current partial expansion. The hash of a key, h(key), 
is therefore computed by the following algorithm: 


h = hi(key); /* initial hash */. 
numgroups = m/2;, /* initial number of groups */ 
for (i = 1; i < level; i++) { 
/* trace the movement of the record */ 
/* through level full expansions */ 
bucketnum = h2(key, 7); 
if (bucketnum > 2) { 
/* record was moved in ith full expansion */ 
groupnum = h % numgroups; 
h = groupnum + (bucketnum - 1) * numgroups; 
} /* end if */ 
numgroups = 2 * numgroups; /* number of groups after */ 
/* 7 full expansions */ 
} /* end for */ 
/* At this point h holds the key's hash after Tevel */ 
/* full expansions and no partial expansions. &/ 
/* Now, compute the record's current location after */ 
/* possible movement in the current full expansion. */ 


groupnum = h % numgroups; /* current group number */ 

i = level + 1; 

bucketnum = h2(key, 7); /* eventual bucket number */ 
/* at end of current full */ 


` Ph expansion be 
/* compute the size of the current group */ : 
/* pe holds the number of the current */ 
/* . partial expansion */ 
if (groupnum < nextgroup) 

groupsize = pe + 2; 


` 
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else 

groupsize = pe + 1; ! 
/* if the eventual bucket number is larger than */ 
/* Current group size, then continue hashing */ 
/* until the current bucket number is found a} 
while (bucketnum > groupsize) { 

i++} 

bucketnum = h2(key, 7); 
} /* end while */ 
if (bucketnum > 2) 

/* the record was already moved */ 

/* in the current full expansion */ 

h = groupnum + (bucketnum - 1) * numgroups ; 


The following example illustrates LH2P. Consider an LH2P file that initially con- 
tains six buckets, 0 through 5, consisting of three groups: (0, 3), (1, 4), and (2, 5). In 
this case, m is 6 and nb is initially 6. Consider six records ro through r5 with keys kO 
through kS that initially hash into 0 through 5, respectively (that is, hI (ki) = i), Then 
r0 through r5 are placed in buckets 0 through 5 initially. 


Assume that the following are the values of h2(key, i) for key equal to kO through 
k5 and i from 1 to 4: 


i 
Key A2(key, 1) A2(key, 2) h2(key, 3) 2 key, 4) 
a a er <i 


kO 1 4 2 3 
kl 4 2 1 2 
k2 3 1 2 1 
k3 4 4 3 1 
k4 1 2 1 $: 
k5 2 4 4 3 


. The following table illustrates the rearrangement.of records during expansion of 
this LH2P file. Each simple expansion is specified in the first column by a status triple 
consisting of the number of full expansions that have taken place (level), the number of 
the current partial expansion(pe), and the number of the group currently being expanded 
{nextgroup). Initially, there are three groups (ng = 3), six buckets (nb = 6), and two 
buckets per group. The second column shows the existing buckets of the current group 
in parentheses. followed by the bucket being added to the group in the current expansion 
Step. We use the notation bi to indicate bucket i. Below the buckets of the group in 
each second-¢glumn entry are the records contained in that group. The third column 
indicates the results of the simple expansion. Of course, for the expansions to take place. 
additional records must be added so that the load factor exceeds the threshold. However, 
we do not illustrate these other records here but merely illustrate how existing records 
move into expansion buckets. 
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Status Group and records Result of expansion 


(0,1,0) > (b0, b3): b6 h2(k0, 1) = 1, so r0 remains in b0. 
(r0. r3) h2(k3, 1) = 4;h2(r3,2) = 4; 

h2(k3, 3) = 3, so r3 moves to b6 
(0,1,1) (b1, b4); b7 h2(kl, t) = 4;h2(kl, 2) = 2, 


so rl remains in b1. 
h2(k4, 1) = 1, so r4 remains in b4. 
(0, 1, 2) (b2, b5), b8 h2(k2, 1) = 3, so r2 moves td b8. 
(42 r5) . h2Kk5, 1) = 2, so r5 remains in b5. 


This ends the first partial expansion. There are still three groups, but each now 
contains three buckets, so nb = 9. The second partial expansion then begins: 


Status Group and records Result of expansion 
(0, 1, 0) (b0, b3. b6); b9 h2(kO, 1) =_1, so rO remains in b0. 
(r0. r3) h2(k3, 1) = 4, so r3 moves to bY. 
(0,1,1) (b1, b4, b7),b10  h2(kl, 1) = 4, so ri moves to b10. 
(rl, r4) H2 kå: 1) = 1, so r4 remains in b4. 
(0, 1, 2) (b2, b5, b8); b11 h2(k2, 1) = 3, so r2 remains in h8. 
5 (r2, r5) h2(k5, 1) = 1, so r5 remains in b5. 


This ends the first full expansion. There are now three groups, and each contains 
four buckets, so nb = 12. To start the second full expansion, the number of groups is 


doubled (ng = 6), and the number of buckets in each is reset to 2. The second full 
expansion proceeds: , 


Status Group and records Result of expansion 

(i, 1,0) (b0. b6); b12 h2(kO, 2) = 4; h2(kO, 3) = 2. 
(0) so rO remains in 60. 

(1,1,1). (b1, b7, ), 613 No records in this group. 

(1, 1,2) (b2, b8); b14 h2(k2, 2) = 4, so r2 remains in b8. 
(r2) 

ick 3) (h4, b10); B15 N2(K3, 2) = 4; h2(43, 3) = 3, 
CC ne so r3 moves to b15: . 

A14) (b4, b10): 616 h2(k1,2) = 2, so rl remains in b10. 

bs (rl, r4) h2(k4, 2) = 2,80 r4 remai::s in b4. 

1,5) (b5. b11); b17 h2KS, 2) = 4:h2(KS, 3) = 4, 4245.4) = 3, 
(r5) so r5 moves to b17. 


This ends the first partial expansion. 


` 
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Status Group and records Result of expansion 


(1,2,0)  (60,66.612);618 h2(k0, 2) = 4, so r0 moves to b18. 
(r0) 

(1,2,1)  (61,07,613); b19 No records in this group. 

(1,2,2) (b2,b8.b14);b20  h2(k2,2) = 1, so rl remains in b8. 


(r2) - 

(1,2,3)  (63,b9,b15); b21 h2(k3, 2) = 4, so r3 moves to 621. 
(r3) : 

(1, 2,4)  (64,b10,b16);622 h2(ki.2) = 2, s0 rl remains in b10. 

ind (b1,b4) h2(k4, 2) = 2, so r4 remains in b4, 

(1,2,5) ` (b5,b11.b17);b23  h2(k5,2) = 4, so r5 moves to b23. 

(r5) z . 
This ends the second full expansion. 


The techniques of LH2P can be generalized to allow n partial expansions in each 
full expansion. Such a scheme is called LHnP. Each partial expansion increases the 
file size by the fraction 1/n. Although higher values of n reduce the average number 
of overflow records (since records hashing to a particular bucket are redistributed more 
frequently) and therefore the number of accesses for both search and insertion, more 
partial expansions require more frequent allocations of storage and more complex hash 
value computations. Thus, practical insertion costs and expansion costs are higher. The 
value n = 2, leaving to the scheme LH2P, is a practical compromise. 

Overflow can be handled by a variety of methods under linear hashing. Use of 
overflow buckets is the simplest technique but may require varying-sized buckets for 
efficiency. Such variation would complicate storage management. Ramamohanarao and 
Sacks—Davis suggest recursive linear hashing in Which records that overflow the prime 
area are placed in a second linear hashing file, records that overflow that area are placed 
in a third, and so on. More than three areas are rarely needed. 

There are several techniques that eliminate the need for separate, dedicated over- 
flow areas. Mullin suggests using chaining within the linear hashing file, in which the 
most recently expanded group is used to contain overflow records (since that group is 
most likely to have empty space). Larson suggests that every kth bucket in the primary 
area be reserved for overflow records, with the hash function suitably modified to avoid 
the overflow buckets. 

Larson also suggests the possibility of using linear rehashing to locate overflow 
records. When linear rehashing is used, it is more efficient to implement each partial 
expansion in several sweeps of step size s > | and to go backward among the groups 
ng — 1,ng — 1 — s, ng — 1.— 2*5, and so on; the second step would expand groups 
ng — 2,ng — 2 — s,ng—2—2*s, and so on. Linear rehashing can also be combined with 
the separator method of Gonnet and Larson to allow one-access retrieval and eliminate 
the overhead of overflow. 


` 


Choosing a Hash Function 


Let us now turn to the question of how to choose a good hash function. Clearly, 
the function should produce as few hash clashes as possible; that is, it should spread the 
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keys uniformly over the possible array indices. Of course, unless the keys are known 
in advance, it cannot be determined whether.a particular hash function disperses them 
properly. However, although it is rare to know the keys before selecting a hash function, 
it is fairly common to know some properties of the keys that affect their dispersal. 

In general, a hash function should depend on every single bit of the key, so that 
two keys that differ in only one bit or one group of bits (regardless of whether the group 
is at the beginning, end, or middle of the key or strewn throughout the key) hash into 
different values. Thus a hash function that simply extracts a portion of a key ‘is not 
suitable. Similarly, if two keys are simply digit or character permutations of each other 
(such as 139 and 319 or meal and lame), they should also hash into different values. 
The reason for this is that key sets frequently have clusters or permutations that might 
otherwise result in collisions. 

For example, the most common hash function (which we haye used in the exam- 
ples of this section) uses the division method, in which an integer key is divided by 
the table size and the remainder is taken as the hash value. This is the hash function 
h(key) = key % tablesize. Suppose, however, that tablesize equals 1000 and that all the 
keys end in the same three digits (for example, the last three digits of a part number 
might represent a plant number, and the program is being written for that plant). Then 
the remainder on dividing by 1000 yields the same value for all the keys, so that a hash 
clash occurs for each record except the first. Clearly, given such a collection of keys, a 
different hash function should be-used. 

> It has been found that the best results with the division method are achieved when 
tablesize is prime (that is, it is not divisible by any positive integer other than 1 and 
itself). However, even if tablesize is prime, an additional restriction is called for. If r 
is the number of possible character codes on a particular computer (assuming an 8- 
bit byte, r is 256), and if tablesize is a prime such that r % tablesize equals 1, the 
hash function key % tablesize is simply the sum of the binary representation of the 
characters in the key modulo tablesize. For example, suppose that r equals 256 and 
that tablesize equals 17, in which case r % tablesize = 1. Then the key 37956, which 
equals 148 » 256 + 68 (so that the first byte of its representation is 148 and the second 
byte is 68), hashes into 37956 % 17, which equals 12, which equals (148 + 68) % 17. 
Thus two keys that are simply character permutations (such as steam and mates) will 
hash into the same value. This may promote collisions and should be avoided. Similar 
problems occur if tablesize is chosen so that r4 % tablesize is very small or very close 
to tablesize for some small value of k. 

Another hash method is the multiplicative method. In this method a real number 
c between 0 and 1 is selected. h(key) is defined as floor(m*frac(c*key)), where the 
function floor(x), available in the standard library math.h, yields the integer part of the 
real number x, and frac(x) yields the fractional part. [Note that frac(x) = x — floor(x).] 
That is, multiply the key by a real number between 0 and 1, take the fractional part of 
the product yielding a random number between 0 and 1 dependent on every bit of the 
key, and multiply by m to yield an index between 0 and m — ' If the word size of the 
computer is b bits, c should be chosen so that 2” « c is an integer relatively prime to 
2, and-c should not be too close to either 0 or 1. Also if r, as before, is the number of ` 
possible character codes, avoid values of c such that frac((r*) * c) is too close to 0 or 
! for some small value of k (these values yield similar hashes for keys with the same last 
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k characters) and of values c of the form i/(r — 1) or i/(r? — 1) (these values yield similar 

hashes for keys that are character permutations). Values of c that yield good theoretical 
properties are 0.6180339887 [which equals (sgrt(5) — 1)/2] or 0.3819660113 [which 

` equals 1 — (sqrt(5) — 1)/2). If mis chosen as a power of 2 such as 2”, the computation 
of h(key) can be done quite efficiently by multiplying the one-word integer key by the 
one-word integer c * 2° to yield a two-word product. The integer represented by the’ 
Most significant p bits of the integer in the second word of this product is then used as 
the value of h(key). ; 

In another hash function, known as the midsquare method, the key is multiplied 
by itself and the middle few digits (the exact number depends on the number of digits 
allowed in the index) of the square are used as the index. If the square is considered 
as a decimal number, the table size must bea power of 10, whereas if it is considered 
as a binary number, the table size must be a power of 2. Alternatively, the number 
represented by the middle digits can be divided by the table size and the remainder 
used as the hash value. Unfortunately, the midsquare method does not yield uniform 
hash values and does not perform as well as the previous two techniques. 

The folding method breaks up a key into several segments that are added or ex- 
clusive ored together to form a hash value. For example, suppose that the internal.bit 
‘String representation of a key is 010111001010110 and that 5 bits are allowed in the 
index. The three bit strings 01011, 10010, and 10110 are exclusive ored to produce 
01111, which is 15 as a binary integer. (The exclusive or of two bits is 1 if the two bits 
are different, and 0 if they are the same. It is the same as the binary sum of the bits, 
ignoring the carry.) The disadvantage of the folding method is that two keys that are 
k-bit permutations of each other (that is, where both keys consist of the same groups of 
k bits in a different order) hash into the same k-bit value. Still another technique is to 
apply a multiplicative hash function to each segment individually before folding. 

There are many other hash functions, each with its own advantages and disadvan- 
tages depending on the set of keys to be hashed. One consideration in choosing a hash 
function is efficiency of calculation; it does no good to be able to find an object on the 
first try if that try takes longer than several tries in an alternative method. 

If the keys are not integers, they must be converted into integers before applying 
one of the foregoing hash functions. There are several ways to do this. For example, for 
a character string the internal bit representation of each character can be interpreted as a 
binary number. One disadvantage of this is that the bit representations of all the letters or 
digits tend to be very similar on most computers. If the keys consist of letters alone, the 
index of each letter in the alphabet can be used to create an integer. Thus the first letter 
of the alphabet (a) is represented by the digits 01 and the fourteenth (n) is represented by 
the digits 14. The key ‘hello’ is represented by the integer 0805121215. Once an integer 
representation of a character string exists, the folding method cañ be used to reduce it to 
manageable size. However, here too, every other digit is a 0, 1, or 2, which may result 
in nonuniform hashes. Another possibility is to view each letter as a digit in base-26 
notation so that ‘hello’ is viewed as the integer 8*26* + 5*263 + 12*267 + 12*26+ 15. 

One of the drawbacks of all these hash functions is that they are not order pre- 
serving; that is, the hash values of the two keys are not necessarily in the same order as 
the keys themselves. It is therefore not possible to traverse the hash table in sequential 
order by key. An example of a hash function that is order preserving is h(key) = kev/c, 
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where c is some constant chosen so that the highest possible key divided by c equals 
tablesize — 1. Unfortunately, order-preserving hash functions usually are’ severely 
nonuniform, leading to many hash clashes and a larger average number of probes to 
access an element. Note also that to enable sequential access to keys, the separate 
chaining method of resolving collisions must be used. 


Perfect Hash Functions 


Given a set of keys K = {k1,k2,... kn}, a perfect hash function is a hash function 
h such that A(ki) != A(kj) for all distinct i and j. That- is, no hash clashes occur under 
a perfect hash function. In general, it is difficult to find a perfect hash function for a 
particular set of keys. Further, once a few more keys are added to the set for which a 
perfect hash function has been found, the hash function generally ceases to be perfect 
for the expanded set. Thus, although it is desirable to find a perfect hash function to 
ensure immediate retrieval, it is not practical to do so unless the set of keys is static and 
is frequently searched. The most obvious example of such a situation is a compiler in 
which the set of reserved words of the rogramming language being compiled does not 
change and must be accessed repeatedly. In such a situation, the effort required to find 
a perfect hashing function is worthwhile because, once the function is determined, it 
can Save a great deal of time in repeated applications. 

Of course, the larger the hash table, the easier it is to find a perfect hash function 
for a given set of keys. If 10 keys must be placed in a table of 100 elements, 63 percent of 
the possible hash functions are perfect ( although as soon as the number of keys reaches 
13 in a 100-item table, the majority are no longer perfect). In the example given earlier, 
if the compiler symbol table is to contain all symbols used in any program so that a 
large table must be allocated to allow for a large number of user-declared identifiers, a 
perfect hash function can easily be found forthe reserved symbols of the language. The 
table can be initialized with the reserved symbols already in the positions determined by 
that function, with the user-defined symbols inserted as they are encountered. Although 
hash clashes may occur for user symbols, we are guaranteed immediaté lookup for the 
reserved symbols. : i 

In general it is desirable to have a perfect hash function for a set of n keys in a 
table of only n positions. Such a perfect hash function is called minimal. In practice 
this is difficult to achieve. Sprugnoli has developed a number of perfect hash function 
determination algorithms. The algorithms are fairly complex and are not presented here. 
One technique finds perfect hash functions of the form h(key) = (key + s)/d for some 
integers s and d. These are called quotient reduction perfect hash functions, and, once 
found, are quite easy to compute. 

For the key set 17, 138, 173, 294, 306, 472, 540, 551, and 618, Sprugnoli’s algo- 
rithm finds the quotient reduction hash function (key + 25)/64, which yields the hash 
values 0, 2, 3. 4, 5. 7, 8, 9,,and 10. The function is not minimal, since it distributes the 
9 keys to a table of 11 positions. Sprugnoli’s algorithm does, ho» ~ver, find the quotient 
reduction perfect hash function with the smallest table size. 


An improvement to the algorithm yields a minimal perfect hash function of the 
‘form : 
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h(key) = (key + s)/d Gf key <et 
h(key) = (key + s + r)/d if key > t 


where the values s, d, t, and r are determined by the algorithm. However, the algorithm 
to discover such a minimal perfect hash function is O(n*) with a large constant'of pro- 
portionality so that it is not practical for even very small key sets. A slight modification 
yields a more efficient algorithm that produces a near-minimal perfect hashing function 
of this form for small key sets. In the foregoing example, such a function is 


h(key) = (key - 7)/72 if kev <= 306 
h(key) = (key - 42)/72 if key > 306 


which yields the hash values 0, 1, 2, 3, 4, 5, 6, 7, and 8 and happens to be minimal. A 
major advantage of quotient reduction hash functions and their variants is that they are 
order preserving. 

> Sprugnoli also presents another group of hashing functions, called remainder re- 
duction perfect hash functions, which are of the form 


a(key) = ((r + s * key) % x)/d 


and an algorithm to produce values r, s, x, and d that yield such a perfect hash function 
for a given key set and a desired minimum load factor. If the minimum load factor 
is set to 1, a minimal perfect hash function results. However, the algorithm does not 
guarantee that a perfect remainder reduction hash function can be found in reasonable 
time for high load factors. Nevertheless, the algorithm can often be used to find minimal 
perfect hash functions for small. key sets in reasonable time. 

S Unfortunately, Sprugnoli’s algorithms are all at least O(n") and are therefore only 
practical for small sets of keys (12 or fewer). Given a larger set of keys, k, a perfect 
hash function can be developed by a technique called segmentation. This technique 
involves dividing k into a number of small sets, ko,k2, ... „Kp, and finding a perfect hash 
function h; for each small set k;. Assume a grouping function ser such that key is in 
the set ksenzey). If m; is the maximum value of h; on k; and-b; is defined as i + mo+ 
mı +--+: + m;-,, we can define the segmented hash function h as h(key) = bsenkey) + 
Rserkey)(key). Of course, the function set that determines the grouping must be chosen 
with care to disperse the keys reasonably. 

Jaeschke presents a method for generating minimal perfect hash functions us- 
ing a technique called reciprocal hashing. The reciprocal hash functions generated by 
Jaeschke’s algorithm are of the form 


h(key) = (c/(d * key + e)) % tablesize 
for some constants c, d, and e, and tablesize equal to the number of keys. Indeed, if the 
keys are all relatively prime integers, a constant c can be found that yields a minimal 


perfect hash function of the form 


(c/key) % tablesize 
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by the following algorithm. Assume that the keys are initially in a sorted array k(0) 
through k(n — 1) and that f(c, key) is the function (clkey) % n. i 


C= ((n - 2) * k(0) * k(n - 1))/(k(n ~ 1) - k(0)); 
while (TRUE) { - 
/* check if c'yields a perfect hash function */ i 
bigi = -1; /* these will be set to the largest values */ 
big} = -1; /* such that f(c, k(bigi)) = f(c, k(bigj)) */ 
for (i = 0; i< n; i++) 
val(i) = f(c, kCi)); 
for (i = n - 1; bigi < 0 & i >= 0; i--) { 
vi = val(i); ; 
jais i 
while (bigi < 0 & j >= 0) 
if (vi == val(j)) { 
bigi = i; 
big} = j; 


else 


} /* end for */ 
if (bigi < 0) 
return; 

/* increment c */ 

x = k(bigj) - (c % k(bigj)); 

y = k(bigi) - (c % k(bigi)); 

(x<y) ? C+ XI CH Y; 
} /* end while */ 


- Applying this algorithm to the key set 3, 5, 11, 14 yields c = 11 and the minimal 
perfect hash function (11/key) % 4. For the key set 3, 5, 11, 13, 14 the algorithm pro- 
duces c = 66. In practice one would set an upper limit on the value of c to ensure that 
the algorithm does not go on indefinitely. i 


If the keys are not relatively prime, Jaeschke presents another algorithm to com- 
pute values d and è so that the values of d * k(i) + e are relatively prime, so that the 
algorithm can be used on those values. 

For low values of n, approximately 1.82" values of c are examined by this algo- 
rithm, which is tolerable for n <= 20. For values of n up to 40, we can divide the keys 
into two sets s1 and s2 of size n1 and n2, where all keys in s1 are smaller than those of 
s2. Then we can find values cl, dl, el and c2, d2, e2 for each of the sets individually 


and use 
h(key) = (c1/(dì * key + el)) % nl 
for keys in sl and 


h(key) = nl 7 ENE. * key + e2)) % m 


for keys in s2. For larger key sets, the segmentation technique of Sprugnoli can be used. 
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Chang presents an order-preserving minimal perfect hash function that depends 
on the existence of a prime number function, p(key), for the set of keys. Such a function 
always produces a prime number corresponding to a given key and has the additional 
Property that if key] is less than key2, p(key1) is less than p(key2). An example of such 
a prime number function is 

p(x) = x? - x4 41 for 1 <= x <= 40 


If such a prime number function has been found, Chang presents an efficient algorithm 
to produce a value c such that the function A(key) = c % p(key) is an order-preserving 
minimal hash function. However, prime number functions are difficult to find, and the 
value c is too large to be practically useful. F Ah 

Cichelli presents a very simple method that often produces a minimal or near- 


minimal perfect hash function for a set of character strings. The hash function produced 
is of the form 


h(key) = val(key[0]) + val(key[length(key) - 1}) + Tength(key) 


where val(c) is an integer value associated with the character c and key[i] is the ith char- 
acter of key. That is, add integer values associated with the first and last characters of 
the key to the length of the key. The integer values associated with particular characters 
are determined in two steps as follows. 

The first step is to order the keys so that the sum of the occurrence frequencies 
of the first and last characters of the keys are in decreasing order. Thus if e occurs 
ten times as a last or first character, g occurs six times, £ occurs nine times, and o oc- 
curs four times, the keys gate, goat, and ego have occurrence frequencies 16 (6+ 10), 
15 (6 + 9), and 14 (10 + 4), respectively, and are therefore ordered properly. 

` Once the keys have been ordered, attempt to assign integer values. Each key is ex- 
amined in turn. If the key’s first or last character has not been assigned values, attempt 
to assign one or two values between 0 and some predetermined limit. If appropriate 
values can be assigned to produce a hash value that does not clash with the hash value 
of a previous key, tentatively assign those values. If not, or if both characters have been 
assigned values that result in a conflicting hash value, backtrack to modify tentative 
assignments made for a previous key. To find a minimal perfect hash function, the pre- 
determined limit for each character is set to the number of distinct first and last character 
occurrences. 

Cichelli perfect hash functions may not exist for some key sets. For example, 
if two keys of the same length have the same or reversed first and last characters, no 
such hash function can exist. In that case, different character positions may be used to 
develop the hash function. However, in other cases no such hash function can be found 
regardless of what character positions are used. In practice it is often useful to attempt to 
find a Cichelli perfect hash function before trying other methods. If the predetermined 
limit is set high enough, so that minimality is not required, Cichelli’s algorithm can be 
quite practical for up to 50 keys. Cook and Oldehoeft present several improvements on 
the basic Cichelli method. 

Sager presents an important generalization and extension of Cichelli‘s method 
that efficiently finds perfect hash functions for as many as 512 keys. The method is 


Sec. 7.4 Hashing 511 


fairly complex and is not presented here; the interested reader is referred to Sager’s 
paper listed in the Bibliography. 

An additional technique for generating minimal perfect hash functions is at- 
tributable to Du, Hsieh, Jea, and Shieh. The technique uses a number of nonperfect 
random hash functions h4, ..., hj and a separate hash indicator table (or hit) of size 
n. The table is initialized as follows. First, set all its entries to 0. Next, apply h, to 
all the keys. For all values x between 0 and n — 1 such that only one key hashes to x 
using h1, reset hit[x] from 0 to 1. Remove all keys that hash to unique values using hy 
from the key set and apply hz to the remaining keys. For all values x between 0 and 
n — l such that hit(x] = O and only one key hashes to x using Az, reset hit{x] from 
O to 2. This process continues until either the key set is empty (in which case hit has 
been initialized and any remaining unused hash functions are unnecessary) or until all 
the hash functions have been applied (in which case, if there are remaining keys, a 
perfect hash tunetion cannot be found using this method and the given random hash 
functions). z 

Once hit has been fully initialized, the hashing algorithm is as follows 

for (i = 0; ; i++) {. 

x = hi(key); 
if (hit(x) == i) 
return(x); 

} /* end for */ 


The probability that a perfect hash function results rises very slowly as additional ran- 
dom hash functions are added. Therefore a segmentation technique, with distinct. a 
tablės, should be used for large sets of keys. 


Universal Classes of Hash Functions 


As we have seen, it is difficult to obtain a perfect hash function for a large set 
of keys. It is also not possible to guarantee that a specific hash function minimizes 
collisions without knowing the precise set of keys to be hashed. If a particular hash 
function is found not to work well in practice in a particular application, it is difficult 
to come up with another hash function that does better. 

* Carter and Wegman have introduced the concept of a universal class of hash 
functions. Such a class consists of a set of hash functions hi(key). Although an individ- 
ual function in the class may work poorly on a particular input key set, enough of the 
functions work well for any random input set that if one function is chosen randomly 
from the class, it is likely to perform well on any input set that is actually presented. 

Given a hash table of size m, and a set a of possible `2ys, a class of nh hash 
functions h is universal, if there are no two keys in a on which more than nh/m of the 
functions in A result in collision. This means that no pair of distinct keys clash under 
more than I/mth of the functions. It can be shown that if k items have been inserted into 
a hash table of size m using a random member of a universal, class of hash functions 
with separate chaining, the expected number of probes for an unsuccessful search is 
less than 1 + k/m (the number for a successful search is even lower). 
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Carter and Wegman present several examples of such universal} classes. One 
_ example of such a class is for keys that can be represented as Positive integers between 
O and w — "I (w — 1 is usually the maximum value that fits into one computer word). 
Let p be a prime number larger than w, let s be an integer between | and p — 1, and let 
t be an integer between 0 and p — 1. Then define hy (key) as ((s*key + t) % p) % m. 
The set of all such functions h,, for given w and p is universal. 

A second example is for keys consisting of i bits and a table size m = 2/ for 
some j. Let a be an i-element array of table indexes (between 0 and m — 1). Then 
define h,.(kev) as the exclusive or of the indexes afk] such that the kth bit of Key is 
l. For example, if m = 128, i = 16. a is an array containing the values 47, 91, 35, 
42, 16. 81. 113, 91, 12, 6. 47, 31, 106. 87, 95. and T1, and key is 15381 (which is 
001111000001010] as a 16-bit number). h, key) is the exclusive or of a[3}, alá), al}, 
a6). a] 12]. af 14], and al 16] (these are 35, 42. 16,47, 31.87 and 11 ). which is 0}11010] 
or 117. The set of functions h, for all such array values a is universal. 

If the hash table is maintained internally isd is not required between program 
runs (as in a compiler, for example). the hash function used may be generated by the 
program from a universals class to guarantee reasonable average running time (although 
any particular run may bz slow). In the preceding examples a random number generator 
might be used to select s. £, and the elements of a. If the hash table remains between 
program runs. as in a file or data base, then a random hash function from the universal; 
class might be selected initially. and i? poor program behavior is observed (although 
this is unlikely). a new random function could be selected and the entire hash table 
reorganized at a convenient time. 

Sarwate has introduced an even better category of hash functions classes. called 
optimally universal: (OU2) classes. If there are nk possible keys and m table entries, 
a set H containing nh hash functions is OU; if any two keys collide under exactly 
nh» (nk — m)(m* (nk ~ 1)) functions in OU; and if, for any function A in H, every key 
collides with exactly nk/m — 1 other keys. Sarwate provides several examples of such 
OU, classes. Unfortunately, hash functions in OU; classes are difficult to compute and 
may not be practically useful. 


EXERCISES 


7.4.1. Write a C function search(table, key) that searches a hash table for a record with key 
key. The function accepts an integer key and a tabie declared by 


struct record { 
KEYTYPE k; 
RECTYPE r; 
*-t flag; 

} array{TABLESIZE]; 


table|i|.& and tubleļi}.r are the ith key and record. respectively. tuble|i} fag equals 
FALSE if the ith table position is empty and FRUE if itis occupied. The routine returns 
an integer in the range O to rablesize - lifa record with key Kev is present in the table 
If no such record exists. the function returns - 1. Assume the existence of a hashing 
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7.4.2. 


7.4.3. 


7.4.4. 


7.4.5. 


7.4.6. 


7.4.7. 


7.4.8. 


routine, A(key), and a rehashing routine rh(index) that both produce integers in the 
range 0 to tablesize — 1. 


Write a C function sinsert(table, key, rec) to search and insert into a hash table as in 
Exercise 7.4.1. ` 


Develop a mechanism for detecting when all possible rehash positions of a given key 
have been searched. Incorporate this method into the C routines search and sinsert of 
the previous exercises. 


_ Consider a double hashing method using primary hash function h1(key) and rehash 


function rh(i) = tablesize % (i + h2(key), tablesize). Assume that h2(key) is relatively 
prime to tablesize, for any key key. Develop a search algorithm and an algorithm to 
insert a record whose key is known not to exist in the table so that the keys at sucessive 
rehashes of a single key are in ascending order. The insertion algorithm may rearrange 
records previously inserted into the table. Can you extend these algorithms to a search 
and insertion algorithm? 


Suppose that a key is equally likely to be any integer between a and b. Suppose the 
midsquare hash method is used to produce an integer between 0 and 24-!. Is the 


. Fesult equally likely to be any integer within that range? Why? 


Given a hash function h(key), write a C simulation program to determine each of the 

following quantities after 0.8 * tablesize random keys have been generated. The keys 

should be.tandom integers. 

1. the percentage of integers between 0:and tablesize — 1 that do not equal A(key) 

` for some generated key 

2. the percentage of integers between 0 and tablesize — 1 that equal h(key) for more 
than one generated key 

3. the maximum number of keys that hash into a single value between 0 and 
tablesize — 1 j 

4. the average number of keys that hash into values between 0 and tablesize — 1, 
not including those values into which no key hashes 

Run the program to test the uniformity of each of the following hash functions. 

(a) h(key) = key % tablesize for tablesize a prime 

(b)  h(key) = key % tablesize for tablesize a power of 2 

(c) The folding method using exclusive or to produce five-bit indices, where 

tablesize = 32 


. (d) The mid-square method using decimal arithmetic to produce four-digit indexes, 


where tablesize = 10,000 


If ahash table contains tablesize positions, and n records currently occupy the table, the 
load factor is defined as n/tablesize. Show that if a hash function uniformly distributes 
keys over the tablesize positions of the table and if If is the load factor of the table, 
(n — 1)*/f?2 of the n keys in the table collided upon insertion with a previously entered 
key. ; 

Assume that n random positions of a tablesize-element hash table are occupied, us- 
ing hash and rehash functions that are equally likely to produce any index in the ta- 
ble. Show that the average number of comparisons needed to insert a new element is 
(tablesize + 1)/(tablesize — n + 1). Explain why linear probing does not satisfy this 
condition. = = 
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-Graphs and Their 


_ Applications. 


In this chapter we consider a new data structure: the graph. We define some of the terms 


associated with graphs and show how to implement them in C. We also present several 
applications of graphs. - 


8.1 GRAPHS 


A graph consists of a set of nodes (or vertices) and a set of arcs (or edges). Each arc ina 
graph is specified by a pair of nodes. Figure 8.1. la illustrates a graph. The set of nodes is 
{A,B,C,D,E,F.G,H}, and the set of arcs is {(A,B),(A,D),(A,C),(C,D),(C, F),(E,G),(A,A)}. 


` represents the first node in the pair. The sét of arcs for the graph of Figure 8.1.1b 
is {<A,B>,<A,C>,<A,D>, <C,D>,<EC>,<E.G>,<A,A>}. We use parentheses to 
indicate an unordered pair and angled brackets to indicate an ordered pair. In the first 
three sections of this chapter, we restrict our attention to digraphs. We consider undi- 
rected graphs again in Section 8.4. 
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Note that a graph need not be a tree (Figure 8.1.1a, b, and d) but that a tree must 
be a graph (Figure 8.1.1c). Note also that a node need not have any arcs associated with 
it (node H in Figure 8.}.1a and b). 

A node n is incident to an arc x if n is one of the two nodes in:the ordered pair of 
nodes that constitute x. (We also say that x is incident to n.) The degree of a node is the. 
number of arcs incident to it. The indegree of a node-n is the number of arcs that have 
n as the head, and the outdegree of n is the number of arcs that have n as the tail. For 
example, node A in Figure 8.1.1d has indegree 1, outdegree 2, and degree 3. A node n 
is adjacent to a node m if there is an arc from m to n. If n is adjacent to m,n is called a 
successor of m, and m a predecessor of n. 

A relation R on a set A is a set of ordered pairs of elements of A. For example. if 
A = {3,5.6,8,10,17}. the set R = {<3.10>,<5.6>,<5.8>.<6.17>.<8,17>. <10.17>} 
is a relation. If <x, y> is a member of a relation R, x is said to be related to y in R. The 
above relation R may be described by saying that x is related to v if x is less than y and the 
remainder obtained by dividing v by x is odd. <8,17> is a member of this relation. since 
8 is smaller than 17 and the remainder on dividing 17 by 8 is 1. which is odd. 

A relation may be represented by a graph in which the nodes represent the un- 
derlying set and the arcs represent the ordered pairs of the relation. Figure 8.1.2a il- 
Justrates the graph representing the foregoing relation. A nuinber may be associated 
with each arc cf a graph as in Figure 8.1.2b. In that figure, the number associated with 
each arc is the remainder obtained by dividing the integer at the head of the arc by 
the integer at the tail. Such a graph. in which a number is associated with each arc. is 
called a weighted graph or a network. The number associated with an arc is culled its 
weight, ; 

We identify several primitive operations that are useful in dealing with graphs. 
The operation join(a,b) adds an arc from node a to node b if one does not already exist. 
Joinwit(a,b.x) adds an arc from a to b with weight x in a weighted graph. remv(a.b) 
and remyvwt(a.b,.x) remove an arc from a to b if one exists (remvwt also sets x to its 
weight). Although we may also want to add or delete nodes from a graph. we postpone 
a discussion of these possibilities until a later section. The function udjacent(a,b) returns 
true if b is adjacent to a, and false otherwise. 

A path of length k from node a to node b is defined as a sequence of k + | nodes 
MyM, 22. Nge; Such that my = a, nga; = b and adjacent(nj. nizi) is true for all i 
between | and k. If for some integer k, a path of length k exists between a und b. there 
is a path from a to b. A path from a node to itself is called a cycle. lia graph contains a 
cycle, it is cyclic; otherwise it is acyclic. A directed acyclic graph is called a dag from 
its acronym. 

Consider the graph of Figure 8.1.3. There is a path of length 1 from A to C. two 
paths of length 2 from B to G, and a path of length 3 from A to F There is no path from 
B to C. There are eycles from B to B. from F to F, and from H to H. Be sure that you 
can find all paths of length less than 9 and al! cycles in the figure. 


Application of Graphs 


We now consider an example. Assume one inpuf line containing four integers 
followed by any number of input lines with two integers each. The first integer on the 
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(b) 


Figure 8.1.2 : Relations and graphs. 


first line, n, represents a number of cities, which for simplicity are numbered from 0 to 
n — 1. The second and third integers on that line are between 0 and n — 1 and represent 
two cities. It is desired to travel from the first City to the second using exactly nr roads, 
where nr is the fourth integer on the first input line. Each subsequent input line contains 
two integers representing two cities, indicating that there is a road from the first city to 
the second. The problem is to determine whether there is a path of required length by 
which one can travel from the first of the given cities to the second. . . 

A plan for solution is the following: Create a graph with the cities as nodes and 
the roads as arcs. To find a path of length nr from node A to node B, look for a node C 
such that an arc exists from A to C and a path of length nr —.1 exists from C to B. If. 
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Figure 8.1.3 


these conditions are satisfied for some node C, the desired path exists. If the conditions 
are not satisfied for any node C; the desired path does not exist. The algorithm uses an 
auxiliary recursive function findpath(k,a,b), whose algorithm we also present shortly. 
This function returns true if there is a path of length k from A to B and false otherwise. 
` The algorithms for the program and the function follow: 


scanf (“%d",:&n); /* number of cities */ 


create n nodes and label them from 0 to n - 1; 

scanf ("%d %d", &a, &b); /* seek path from ato b */ 

scanf ("%d", &nr); /* desired number of */ 
j /* roads to take */ 


while(scanf("%d %d", &cityl, &city2) != EOF) 
join(cityl, city2); 

if (findpath(nr,a,b)) 
printf ("a path exists from %d.to %d in %d steps", 


= a, b, nr); 
else 
printf("no path exists from %d to %d in %d steps", 
a, b, nr); 


‘ 


The algorithm for the function findpath(k,a,b) follows: 


if (k == 1) oe 
/* search for a path of length 1 */ 
return (adjacent(a,b)); 

/* determine if there is a path through c_ */ 


Sec. 8.1 Graphs ` i : 519 


for (c = 0; c < n; ++c) 
if (adjacent(a,c) && findpath(k - 1, c, b)) 
return (TRUE); 
return (FALSE); /* assume no path exists */ 


Although the foregoing algorithm is a solution to the problem, it has several defi- 
ciencies. Many paths are investigated several times during the recursive process. Also, 
although the algorithm must actually check each possible path, the final result merely 
ascertains whether a desired path exists: it does not produce the path itself, More likely 
than not, it is desirable to find the arcs of the path in addition to knowing whether or not 
a path exists. Finally, the algorithm does not test for the existence of a path regardless of 
length; it only tests for a path of specific length. We explore solutions to some of these 
problems later in this chapter and in the exercises. 


C Representation of Graphs 


Let us now turn iv the question of representing graphs in C. Suppose that the 
number of nodes in the graph is constant: that is, arcs may be added or deleted but 
nodes may not. A graph with 50 nodes could then be declared as follows: 


#define MAXNODES 50 


struct node { 
/* information associated with each node */ 
}; 
struct arc { 
int adj; 
/* information associated with each arc */ 
}; 
struct graph { 
struct node nodes [MAXNODES]; 
struct arc arcs{MAXNODES] [MAXNODES] ; 
E 
struct graph g; 


Each node of the graph is represented by an integer between 0 and 
MAXNODES — 1, and the array field nodes represents the appropriate information as- 
signed to each node. The array field arcs is a two-dimensional array representing every 
possible ordered pair of nodes. The value of g.ares{i]lj J.adj is either TRUE or FALSE 
depending on whether or not node j is adjacent to node i. The two-dimensional array 
g.ares{ [ }.adj is called an adjacency matrix. In the case of a wes, Wed graph. each arc 
can also be assigned information. 

Frequently the nodes of a graph are numbered from 0 to MAXNODES — 1 and 
no information is assigned to them. Also, we may be interested in the existence of ares 
but not in any weights or other information about them. In such cases the graph could 
be declared simply by 


int adj [MAXNODES] {MAXNODES] ; 
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In effect, the graph-is totally described by its adjacency matrix. We present the code 
for the primitive operations just described in the case where a graph is described by its 
adjacency matrix. 


void join (int adj{][MAXNODES), int nodel, int node2) 
{ 
/* add an arc from nodel to node? */ 
_  adj[nodel][node2] = TRUE; 
} /* end join */ 
void remv(int adj[][MAXNODES], int nodel, int node2) 
{ 
/* delete arc from nodel to node2 if one exists ai 
adj(nodel}{node2] = FALSE; 
} /* end remy */ 
int adjacent(int adj[][MAXNODES], int nodel, int node?) 
i 


return((adj[node1] [node2] == TRUE)? TRUE: FALSE); 
} /* end adjacent */ 


A weighted graph with a fixed number of nodes may be declared by 
Struct arc { 

int adj; 

int weight; 


struct arc g[MAXNODES] [MAXNODES] ; 


The routine joinwi, which adds an are from node! to node? with a given weight wr, 
may be coded as follows: 


void joinwt (struct arc g[](MAXNODES], int nodel, int node2, int wt) 
{ 
g[node1] ({node2].adj = TRUE; 


g[node1] [node2] .weight = wt; 
} /* end joinwt */ 


The routine remvwt is left to the reader as an exercise. 


Transitive Closure 


Let us assume that a graph is completely described by its adjacency matrix, adj 
(that is, no data is associated with the nodes and the graph is not weighted). Consider 
the logical expression adjfi][k] && adj\k | j |. lts value is TRUE if and only if the values 
of both adj[i|{k] and adj|A|Uj-are TRUE, which implies that there is an arc from node 
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i to node k and an arc from node k to node J. Thus adj[i)[k} && adj[k][j] equals TRUE ` 
if and only if there is a path of length 2 from i to J passing through k. 
Now consider the expression i i 


Cadj{i}[0] && adj{O}(J)) || Cadjli) [1] && adj(1) (71) 
1...1] Cadjli] [MAXNODES - 1] && adj[MAXNODES -"1}[j]) 


The value of this expression is TRUE only if there:is a path of length 2 from node i to 
node j either through nade 0 or through node 1, ... or through. node MAXNODES — 1. 
This is the same as saying that the expression evaluates to TRUE if and only if there is 
some path of length 2 from node i to node j. i 

Consider an array adjz such that adj2[i][j ] is the value of the foregoing expres- 
sion. adj, is called the path matrix of length 2: adj2{i}[j] indicates whether or not 
there is a path of length 2 between i and j. (If you are familiar with matrix multi- 
plication, you should realize that adj is the product of adj with itself, with numeri- 
cal multiplication replaced by conjunction (the && operation) and addition replaced 
by disjunction (the || operation).) adj. is said to be the Boolean product of adj with 
itself. . i 

Figure 8.1.4 Badija Figure 8.1.4a depicts a graph and its adja- 
.cency matrix in which true is represented by 1 and false is represented by 0. Figure 
-8.1.4b is the Boolean product of that matrix with itself, and thus is the path matrix of 
length 2 for the graph. Convince yoursélf that a 1 appears in row i, column j of the 
matrix of Figure 8.1.4b if and only if there is a path of length 2 from node i to node j in 
the graph. f 

Similarly, define adj3, the path matrix of length three, as the Boolean product of 

adj, with adj. adj;{i)[ j ] equals true if and only if there is a path of length 3 from i to j. 
In general, to compute the path matrix of length Z, form the Boolean product of the path 
matrix of length / — 1 with the adjacency matrix. Figure 8.1.5 illustrates the matrices 
adj, and adj, of the graph in Figure 8.1.4a.- 


(b) adj, Figure 8.1.4 
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(a) adj; . (b) adjs | Figure 8.1.5 


Assume that we want to know whether a path of length 3 or less exists between 
two nodes of a graph. If such a path exists between nodes i and j, it must be of length 
1, 2, or 3. If there is a path of length 3 or less between nodes i and j the value of 


ajli IL adja LÙ 11 adj3 UL 


must be true. Figure 8.1.6 shows the matrix formed by “or-ing” the matrices adj, adjz, 
and adj3. This matrix contains the value TRUE (represented by the value 1 in the fi gure) 
in row i, column j, if and only if there is a path of length 3 or less from node i to 
node j. 

. Suppose that we wish to construct a matrix path such that path{i)[j] is TRUE if 
and only if there is some path from node i to node j (of any length). Clearly, j 


path[i][j) == adjli)(j) 1! adj2 [il] ||... 


However, the preceding equation cannot be used in computing path, since the process 


that it describes is an infinite one. However, if the graph has n nodes, it must be true 
that — ; 


pathtil Lj) == adj[i)[j} || adj2 [i] [j] Aá li adj, [iL] 


This is because if there is a path of length m > n from i to j such as i, fos SEE T A 
there must be another path from i to j of length less than or equal to n. To see this, note 
that since there are only n nodes in the graph, at least one node k-must appear in the path 
twice. The path from i to j can be shortened by removing the cycle from k to k. This 
Process is repeated until no two nodes in the path (except possibly i and j) are equal 
and therefore the path is of length n or less. Figure 8.1.7 illustrates the matrix path for 


` the graph of Figure 8.1.4a. The matrix path is often called the transitive closure of the 
matrix-adj. ` 


Figure 8.1.7 path = adjor 
Figure 8.1.6 adj or adj; or-adj, or adjs. 
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We may write a C routine that accepts an adjacency matrix adj and computes its 
transitive closure path. This routine uses an auxiliary routine prod(a,b,c), which sets 
the array c equal to the Boolean product of a and b. 


void transclose (int adj[][MAXNODES], int path{) [MAXNODES]) 
{ 
int i, j, k; l 
int newprod[MAXNODES] [MAXNODES] , 
adjprod[MAXNODES] [MAXNODES] ; 


for (i = 0; i < MAXNODES; ++i) 
for (j = 0; j < MAXNODES; ++j) 
adjprod[i] [j] = path[i][j] = adj[i][j]; 
for (i = 1; i < MAXNODES; ++i) { 
/* i represents the number of times adj has */ 


/* heen multiplied by itself to obtain */ 
/* aujpred. At this point path represents */ 
/* ail paths of length i or less */ 


prod (adjprod, adj, newprod); 
for (j = 0; j < MAXNODES; ++j) 
for (k = 0; k < MAXNODES; ++k) 
path[j][k] = path[j][k] || sewprod{j][k]; 
for (j = 0; j < MAXNODES; ++) 
for (k = 0; k < MAXNODES; ++k) 
adjprod(j][k] = newprod[j][k]; 
} /* end for */ 
} /* end transclose */ 


The routine prod may be written as follows: 
void prod (int a[][MAXNODES], int b[][MAXNODES], int c[] [MAXNODES]) 
{ 
‘int i, j, k, val; 


for (i = 0; i < MAXNODES; ++i) /* pass through rows ay 

for (j = 0; j < MAXNODES; ++j) { /* pass through columns */ 
val = FALSE; 
for (k = 0; k < MAXNODES; ++k) 

val = val || (a[i] [k] & b{k](j]); 

c[i][j] = val; 

} /* end for j */ 

} /* end prod */ 


To analyze the efficiency (or inefficiency) of this routine. note that finding the 
boolean product by the method we have presented is Our), where n is the number 
of graph nodes (that is, MAXNODES). In transclose, this process (the call to prod) is 
embedded in a loop that is repeated n — 1 times, so that the entire transitive closure 
routine is Oin’). 
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Warshall’s Algorithm 


The foregoing method is quite inefficient. Let us see if a more efficient method 
to compute path can be produced. Let us define the matrix path, such that path, [il] 
is true if and only if there is a path from node i to node j that does not pass through 
any nodes numbered higher than k (except, possibly, for i and j themselves). How can 
the value of path,., [iJ[j] be obtained from path,? Clearly for any i and j such that 
path, [i] = TRUE, pathy., (iY) must be TRUE (why?). The only situation in which 
pathy.) (ij) can be TRUE while path, |i}[j] equals FALSE is if there is a path from 
i to j passing through node k + 1, but there is no path from i to j passing through only 
nodes | through k. But this means that there must be a path from į to k + | passing 
through only nodes 1 through k and a similar path from k + 1 toj. Thus path;., (OU) 
equals TRUE if and only if one of the following two conditions holds: 


1. pathy [i){j} == TRUE 
2. pathy [i}[k + 1) == TRUE and pathy [k + 1)[j} == TRUE 


This means that pathys, [i[ j] equals path, fv) | (path, {fk + 1) && 
path, [k + 1JU/)). An algorithm to obtain the matrix path, from the matrix path,-\ 
based on this observation follows: 


for (i = 0; i < MAXNODES; ++i) 
for (j = 0: j < MAXNODES; ++j) 
pathy [iJ[j] = path yy CALA] |I (path pz DIK & path k-1 IKL): 


This may be logically simplified and made more efficient as follows: 


for (i = 0; i < MAXNODES; ++i) 
for (j = 0; j < MAXNODES; ++j) 
pathy LJL] = path x7 DLJ); 
for (i = 0; i < MAXNODES; ++i) 
if (path y.4 [i] [k] == TRUE) 
for (j = 0; j < MAXNODES; ++j) 
pathy [iJ[j] = path x2 (ACA) il path yy (AL: 


Clearly, patho [i]|j] = adj, since the only way to go from node i to node j without 
passing through any other nodes is to go directly from / to j. Further, pathsyaxnopes-! 
[AL] = parh{i}{ j }, since if a path may pass through any nodes numbered from 0 
to MAXNODES ~—1, any path from node i to node j may be selected. The following C 
routine may therefore be used to compute the transitive closure: 


void transclose (int adj[}[MAXNODES], int path(] [MAXNODES]) 
{ 
int: 7, J. Ke 
for (i = 0; i < MAXNODES; ++i) 
for (j = 0; j < MAXNODES; ++j) 
path[i]{j) = adj{iJ(j]; /* path starts off as adj */ 
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for (k = 0; k < MAXNODES: ++k) 
for (i = 0; i .<.MAXNODES; +i) 
if (path [i][k] == TRUE) 
for (j = 0; j < MAXNODES; +j) 
: path [i][j] = path [i][j] || path [k] [j]; 
}/* end transclose */ 


This technique increases the efficiency of finding the transitive closure to O(n*). The 
~ Method is often called Warshall’s algorithm, after its discoverer, 


l Shortest-Path Algorithm 


In a weighted graph, or network, it is frequently desired to find the shortest path 
between two nodes, s and £. The shortest path is defined as a path from s to t such that 
the sum of the weights of the arcs on the path is minimized. To represent the network, 
we assume a weight function, such that weight (i, j) is the weight of the arc from i to fs 
If there is no arc from i to J, weight(i,j) is set to an arbitrarily large value to indicate the 
infinite cost (that is, the impossibility) of going directly from i toj. l 


Tf all weights are positive, the following algorithm, attributable to Dijkstra, de- 


tially, distance[s] = 0 and distance[i] = infinity for all i != s. A set perm contains all’ 
nodes whose minimal distance from s is known—that is, those nodes whose distance 
value is permanent and will not change. If a node i is a member of perm, distance{(i] is 
the minimal distance from s to i. Initially, the only member of perm is s. Once t becomes 
a member of perm, distance[t] is known to be the shortest distance from s to 1, and the 


The algorithm maintains a variable, current, that is the node that has been added 
to perm most recently. Initially, current = 5. Whenever a node current is added to 
perm, distance must be recomputed for all successors of current. For every successor i 
of current, if distance[current] + weight(current, i) is less than distance[i), the distance 
from s to i through current is smaller than any other distance from s to i found thus far. 
Thus distance[i] must be reset to this smaller value. ` 


distance[nd] is greater than distance[k}.) Thus k can be added to perm. current is then 
reset to k and the process is repeated. J : 

The following is a C routine to implement this algorithm. In addition to calcu- 
lating distances, the program finds the shortest path itself by maintaining an array 
precede such that precedeļi] is the node that precedes node i on the shortest path found 
thus far. An array perm is used to keep track of the corresponding set. Perm[i] is 1 if iisa 
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member of the set and 0 if not. The routine accepts a weight matrix (with nonadjacent 
arcs having a weight of infinity) and two nodes, s and t, and calculates the minimum 
distance pd from s to t as well as the array precede to define the path. The routine 
assumes the following definitions and declarations: ag 


f#define INFINITY ... 
#define MAXNODES ... 
#define MEMBER 1 

#define NONMEMBER 0 


void shortpath (int weight) [MAXNODES] , int s, int t, int *pd, int precede[]) 
{ ae 


int tistance[MAXNODES], perm[MAXNODES]; 
int current, i, k, dc; 
-int smalldist, newdist; 


/* initialization */ | 
for (i = 0; i < MAXNODES; ++i) { 
perm[i] = NONMEMBER; 
distance[i] = INFINITY; 
} /* end for */ 
perm[s] = MEMBER; 
distance[s} = 0; 
current = S; 
while (current != t) { 
smalldist = INFINITY; - 
dc = distance[current]; 
for (i =.0; į < MAXNODES; i++) 
if (perm{i] == NONMEMBER) { 
newdist = dc + weight[current] [i]; 
if (newdist < distance[i]) { gene 
/* distance from s to i through current is */ 
/* smaller than - distance[i] =*/ 
distance[i] = newdist; Magara 
precede[i] = current; 
} /* end if */ ; 
/* determine the smallest distance */ 
if (distance[i] < smalldist) { 
smalldist = distance[i]; 
keji; 
} /*. end if */ 
} /* end for... if */ 
current = k; 
perm[current] = MEMBER; 
}/* end while */ 
*pd = distance[t]; 
} /* end shortpath */ 
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An alternative implementation that maintains the set of “permanent” nodes as a 
linked list instead of the array perm is left as an exercise for the reader. 

Assuming that a function al/(x) has been defined to return TRUE if every element 
of array x is ] and FALSE otherwise, Dijkstra's algorithm can be modified to find the 
shortest path from a node s to every other node in the graph by modifying the while 
header to 


while (all(perm) == FALSE) 


To analyze the efficiency of this implementation of Dijkstra’s algorithm, note that 
one node is added to perm in each iteration of the while loop so that, potentially, the 
loop must be repeated n times (where n = MAXNODES., the number of nodes in the 
graph). Each iteration involves examining every node [for (i = 0: i < MAXNODES; 
++i), so the entire algorithm is O(n?). We examine a more efficient implementation 
of Dijkstra's algorithm in Section 8.3. 


EXERCISES 


8.1.1. For the graph of Figure 8.1.1b: 
(a) Find its adjacency matrix. 
(b) Find its path matrix using powers of the adjacency matrix. 
(c) Find its path matrix using Warshall’s algorithm. 

8.1.2. Draw a digraph to correspond to each of the following relations on the integers from 1 

to 12. 

(a) xis related to y if x — v is evenly divisible by 3. 

(b) xis related to vif x+ 10*y < xy. 

(c) xis related to y if the remainder on division of x by y is 2. 
Compute the adjacency and path matrices for each of these relations. 

8.1.3. A node nl is reachable from a node n2 in a graph if nl equals n2 or there is a path 
from n2 to nl. Write a C function reach(adj.i,j) that accepts an adjacency matrix and 
two integers and determines if the jth node in the digraph is reachable from the ith node. 

8.1.4. Write C routines which, given an adjacency matrix and two nodes of a graph, compute: 
(a) The number of paths of a given length existing between them 
(b) The number of total paths existing between them 

8.1.5. A relation ona set S (and its corresponding digraph) is symmetric if for any two elements 
x and y in S such that x is related to v, y is also related to x. 

(a) What must be true of a digraph if it represents a symmetric relation? 

(b) Give an example of a symmetric relation and draw its digraph. 

(c) What must be true of the adjacency matrix of a symmetric digraph? 

(d) Write a C routine that accepts an adjacency matrix and ¿© | ‘nines if the digraph 
it represents is symmetric. 

8.1.6. A relation où a set S (and its corresponding digraph and adjacency matrix) is transitive 
if for any three elements x, y. and z in S, if x is related to v and v is related to z, x is 
related to z. 
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(a) What must be true of a digraph if it represents a transitive relation? 
(b) Give an example of a transitive relation and draw its digraph. 


(c) What must-be true of the Boolean product of the adjacency matrix of a transitive 
digraph with itself? 


(d) Write a C routine that accepts an adjacency matrix and determines if the digraph 
` it represents is transitive. 
(e) Prove that the transitive closure of any digraph is transitive. 
(f) Prove that the smallest transitive digraph that includes all nodes and arcs of a given 
digraph is the transitive closure of that digraph. 
8.1.7. Given a digraph, prove that it is possible to renumber its nodes so that the resultant 
s adjacency matrix is lower triangular (see Exercise 1.2.8) if and only if the digraph is 
acyclic. Write a C function lowtri(adj, ltadj, perm) that accepts an adjacency matrix adj 
of an acyclic graph and creates a lower triangular adjacency matrix /tadj that represents 
the same graph. perm is a one-dimensional array such that perm[i) is set to the new 
number assigned to the node that was numbered i in the matrix adj. 


8.1.8. Rewrite the routine shortpath to impleme.. ‘he set of “permanent” nodes as a linked 
list. Show that the efficiency of the method remains O(n?). 


8.2 FLOW PROBLEM 


In this section we consider a real-world problem and illustrate a solution that uses a 

` weighted graph. There are a number of formulations of this problem whose solutions 
carry over to a wide range of applications. We present one such formulation here and 
refer the reader to the literature for alternate versions. 

Assume a water pipe system as in Figure 8.2.1a. Each arc represents a pipe and 
the number above each arc represents the capacity of that pipe in gallons per minute. 
The nodes represent points at which pipes are joined and water is transferred from one 
pipe to another. Two nodes, S and T, are designated as a source of water and:a user of 
water (or a sink), respectively. This means that water originating at S must be carried 
through the pipe system to T. Water may flow through a pipe in only one direction 
(pressure sensitive valves may be used to prevent water from flowing backward), and 
there are no pipes entering S or leaving T. Thus, a weighted directed graph, as in Figure 
8.2.1a, is an ideal data structure to model the situation. 

We would like to maximize the amount of water flowing from the source to the 
sink. Although the source may be able to produce water at a prodigious rate and the sink 
may be able to consume water at a comparable rate, the pipe system may not have the 
capacity to carry it all from the source to the sink. Thus the limiting factor of the entire 
system is the pipe capacity. Many other real-world problems are similar in nature. The 
system could be an electrical network, a railway system, a communications network, 
or any other distribution system in which one wants to maximize the amount of an item 
being delivered from one point to another. 

Define a capacity function, cla,b), where u and b are nodes, as follows: If ad- 
jacent(a,b) is true (that is, if there is a pipe from a to b), c(a,b) is the capacity of the 
pipe from a to b. If there is no pipe from a to b, c(a,b) = 0. At any point in the operation 
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of the system, a given amount of water (possibly 0) flows through each pipe. Define a 
flow function, fia,b), where a and b are nodes, as 0 if b is not adjacent to a, and as the 
amount of water flowing through the pipe from a to b otherwise. Clearly, fia,b) >= 0 
for all nodes a and b. Furthermore, fla,b) <= c(a,b) for all nodes a and b, since a pipe 
may not carry more water than its capacity. Let v be the amount of water that flows 
through the system from S to T. Then the amount of water leaving S through all pipes 
equals the amount of water entering T through all pipes and both these amounts equal 
v. This can be stated by the equality: 


È fSxHy=v= SS fT) 


x€ nodes x€ nodes 


No node other than S can produce water and no node other than T can absorb water. 
Thus the amount of water leaving any node other than S or 7 is equal to the amount of 
water entering that node. This can be stated by 


pa S(x,y) = > S(y, x)! àll nodes x!= §,T 
vE nodes ; yE nodes ` 


Define the inflow of a node x as the total flow entering x and the outflow as the total 
fiow leaving x. The foregoing conditions may be rewritten as 


outflow (S) = inflow (T) = v 
inflow (x) = outflow (x) for all x!= S, T 


Several flow functions may exist for a given graph and capacity function. Figures 
8.2.1b and c illustrate two possible flow functions for the graph of Figure 8.2.1a. Make 
sure that you understand why both of them are valid flow functions and why both satisfy 
the foregoing equations and inequalities. 

We wish to find a flow function that maximizes the value of v, the amount of water 
going from S to T. Such a flow function is called optimal. Clearly, the flow function of 
Figure 8.2.1b is better than the one of Figure 8.2.1c, since v equals 7 in the former but 
only 5 in the latter. See if you can find a flow function which is better than the one of 
Figure 8.2.1b. 

One valid flow function can be achieved by setting fla,b) to O for all nodes a 
and b. Of course this flow function is least optimal, since no water flows from S to T. 
Given a flow function, it can be improved so that the flow from S to T is increased. 
However, the improved version must satisfy all the conditions for a valid flow function. 
In particular, if the flow entering any node (except for S or T) is increased or decreased, 
the flow leaving that node must be increased or decreased correspondingly. The strategy 
for producing an optimal flow function is to begin with the zero flow function and to 
improve upon it successively until an optimal flow function is produced. 


Improving a Flow Function 


Given a flow function f there are two ways to improve upon it. One way con- 
sists of finding a path S=x),x2,...,x» = T from S to T such that the flow along each arc 
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in the path is strictly less than the capacity (that is, f (xx-1, Xx) < C(x,-1, xx) for all 
k between | and n — 1). The flow can be increased on each arc in such a path by the 
minimum value of c(x,-}, Xk) — f (xg-1, Xx) for all k between 1 and n —-1 (so that when 
the flow has been increased along the entire path there is at least one arc <Xk- 14> 
in the path for which f(x,-),x,) = ¢(x,-1,X«) and through which the flow may not be 
increased). - : , 

This may be illustrated by the graph of Figure 8.2.2a which gives the capacity 
and the current flow respectively for each arc. There are two paths from S to T with 
Positive flow ((S,A,C,T) and (S,B,D,T)). However each of these paths contains one arc 
(<A,C> and <B,D>) in which the flow equals the capacity. Thus the flow along these 
paths may not be improved. However, the path (S,A,D,7) is such that the capacity of 
each arc in the path is greater than its current flow. The maximum amount by which 
the flow can be increased along this path is 1, since the flow along arc <D,7> cannot 
exceed 3. The resulting flow function is shown in Figure 8.2.2b. The total flow from S 
to T has been increased from 5 to 6. To see that the result is still a valid flow function 
‘note that for each node (except T) whose inflow is increaséd, the outflow is increased 
by the same amount. 

Are there any other paths whose flow can be improved? In this example, you 
should satisfy yourself that there are not. However, given the graph of Figure 8.2.2a 
we could have chosen to improve the path (S,B,A,D,T). The resulting flow function is 
illustrated in Figure 8.2.2c. This function also provides for a net flow of 6 from S to T 
and is therefore neither better nor worse than the flow function of Figure 8.2.2b. 

Even if there is no path whose flow may be improved, there may be another 
method of improving the net flow from the source to the sink. This is illustrated by 
Figure 8.2.3. In Figure 8.2.3a there is no path from S to T whose flow may be im- 
proved. But if the flow from X to Y is reduced, the flow from X to T can be increased. To 
compensate for the.decrease in the inflow of Y the flow from S to Y could be increased, 
thereby increasing the net flow from S to T. The flow from X to Y can be redirected to 

“T as shown in Figure 8.2.3b and the net flow from S to T can thereby be increased from 
407. 
We may generalize this second method as follows. Suppose that there is a path- 
` from S to some node Y, a path from some node X to T and a path from X to Y with 
positive flow. Then the flow along the path from X to Y may be reduced and the flows 
from X to T and from S to Y may be increased by the same amount. This amount is the 
minimum of the flow from X to Y and the differences between capacity and flow in the 
paths from S to Y and X to T. 

These two methods may be combined by proceeding through the graph from S 
to T as follows: The amount of water emanating from S toward F can be increased by 
any amount (since we have assumed no limit on the amount that can be produced by 
the source) only if the pipes from S to T can carry the increase. S.-npose that the pipe 
capacity from S to x allows the amount of water entering x to be increased by an amount 
a. If the pipe capacity to carry the increase from x to T exists then the increase can be 
made. Then if a node y is adjacent to x (that is. there is an arc <x,y>), the amount of 
water emanating from y toward T can be increased by the minimum of a and the unused 
capacity of arc <x,y>. This is an application of the first method. Similarly, if node x is 
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adjacent to some node y (that is, there is an arc <y,x>), the amount of water emanating 
from y toward T can be increased by the minimum of a and the existing flow from y to x. 
This can be done by reducing the flow from y to x as in the second method. Proceeding 
in this fashion from S to T, the amount by which the flow to T may be increased can be 
' determined. i 

Define a semipath from S to T as a sequence of nodes $ = x),x2,...4n = T 
such that, for all 0 <i <= n — 1, either <x;-1,Xi> or <xj,x;-;> is an arc. Using ~ 
the foregoing technique, we may describe an algorithm to discover a semipath from S 
to T such that the flow to each node in the semipath may be increased. This is done 
by building upon already discovered partial semipaths from S. If the last node in a 
discovered partial semipath from S is a, the algorithm considers extending it to any 
node b such that either <a,b> or <b,a> is an arc. The partial səmipath is extended to 
b only if the extension can be made in such a way that the inflow ic ` can be increased. 
Once a partial semipath has been extended to a node b, that node is ic:noved from 
consideration as an extension of some other ‘partial semipath. (This is because. at this 
point we are trying to discover a single semipath from S to T.) The algorithm of course 
keeps track of the amount by which the inflow to b may be increased and whether its 
increase is due to consideration of the arc <a,b> or <b,a>. 


` 
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This process continues until some partial semipath from S has been completed by 
extending it to T. The algorithm then proceeds backward along the semipath adjusting 
all flows until S is reached. (This will be illustrated shortly with an example.) The entire 
process is then repeated in an attempt to discover yet another such semipath from S to 7. 
When no partial semipath may be successfully extended, the flow cannot be increased 
and the existing flow is optimal. (You are asked to prove this as an exercise.) 


Example 


Let us illustrate this process with an example. Consider the arcs and capacities of 
the weighted graph of Figure 8.2.4. We begin by assuming a flow of 0 and attempt to 
discover an optimal flow. Figure 8.2.4a illustrates the initial situation. The two numbers 
above each arc represent the capacity and current flow respectively. We may extend a 
semipath from S to (S,X) and (S,Z), respectively. The flow from S to X may be increased 
by 4, and the flow from S to Z may be increased by 6. The semipath (S,X) may be 
extended to (S,X,W) and (S,X,Y) with corresponding increases of flow to W and Y of 3 
and 4, respectively. The semipath (S,X, Y ) may be extended to (S,X, ¥,7) with an increase 
of flow to T of 4. (Note that at this point we could have chosen to extend (S,X,W) 
to (S,X,W,T). Similarly we could have extended (S,Z) to (S,Z,Y) rather than (S,X) to 
(S,X,W) and (S,X,Y). These decisions are arbitrary.) 

Since we have reached T by the semipath (5S,X,¥.7) with a net increase of 4, we 
increase the flow along each forward arc of the semipath by this amount. The results 
are depicted in Figure 8.2.4b. 

We now repeat the foregoing process with the flow of Figure 8.2.4b. (S) may 
be extended to (S,Z) only, since the flow in arc <S,X> is already at capacity. The net 
increase to Z through this semipath is 6. (S,Z) may be extended to (S,Z, Y), yielding a net 
increase of 4 to Y. (S,Z, Y) cannot be extended to (S,Z,¥,7), since the flow in arc <Y,T> 
is at capacity. However, it can be extended to (S,Z, YX) with a net increase to node X of 
4. Note that since this semipath includes a backward arc <¥,X >, it implies a reduction 
in the flow from X to Y of 4. The semipath (S,Z,¥,X) may be extended to (S,Z,¥,X, 
W) with a net increase of 3 (the unused capacity of <X,W>) to W. This semipath may 
then be extended to (S,Z, ¥.X,W,T) with a net increase of 3 in the flow to T. Since we 
have reached T with an increase of 3, we proceed backward along this semipath. Since 
<W,T> and <X,W> are forward arcs, their flow may each be increased by 3. Since 
` <Y,X > is a backward arc, the flow along <X,Y> is reduced by 3. Since <Z,Y> and 

<S,Z> are forward arcs, their flow may be increased by 3. This results in the flow 
shown in Figure 8.2.4c. S ; 

We then attempt to repeat the process. (S) may be extended to (S,Z) with an in- 
crease of 3 to Z, (S,Z) may be extended to (S,Z,Y) with an increase of 1 to Y, and 
(S,Z,Y) may be extended to (S,Z,¥,X) with an increase of 1 to X. However, since arcs - 
<S,X>, <Y.T> and <X,W> are at capacity, no semipath may be extended further, 
and an optimum flow has been found. Note that this optimum flow need not be unique. 
Figure 8.2.4d illustrates another optimum flow for the same graph which was obtained 
from Figure 8.2.4a by considering the semipaths (S,X,W.T) and (S,Z, ¥,T). 
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Algorithm and Program 


Given a weighted graph (an adjacency matrix and a capacity matrix) with a source 
S and a sink T, the algorithm to produce an optimum flow function for that graph may 
be outlined as follows: 


1 initialize the flow function to 0 at each arc; 

2 . canimprove = TRUE; 

3 do { 

4 attempt to find a semipath from S to T that k 

increases the flow to T by x> 0; 

5 if (a semipath cannot be found) ° 
-Canimprove = FALSE; 

G -* else 


increase the flow to each node (except 5) 
` in the semipath by x; 


7 } mhile (canimprove == TRUE); 


Of course, the heart of the algorithm lies in line 4. Once a node has been placed 
on a partial semipath, it can no longer be used to extend a different semipath. Thus the 
algorithm uses an array of flags onpath such that onpath[node) indicates whether or not 
node is on some semipath. It also needs an indication of which nodes are at the ends 
of partial semipaths so that such partial semipaths can be extended by adding adjacent 
nodes. endpath[node] indicates whether or not node is at the end of a partial semipath. 
For each node on a semipath the algorithm must keep track of what node precedes it 
on that semipath and the direction of the arc. precede[node] points to the. node’ that 
precedes node on its semipath, and forward{node] has the value TRUE if and only if 
the arc is from precede[node} to node. improve[node] indicates the amount by which 
the flow to node may be increased along its semipath. The algorithm that attempts to 
find a semipath from S to T along which the flow may be increased may be written as 


follows. (We assume that c[a][b] is the capacity of the pipe from a to b and that f[a][b] 
is the current flow from a to b.) 


set endpath[node], onpath[node} to FALSE for all nodes; 
endpath[S] = TRUE; 
onpath[S] = TRUE; . 
/* compute maximum flow from S that pipes can carry */ 
improve[S] = sum of c[S][node] over all nodes node; 
while ((onpath[T] == FALSE) i 
&& (there exists a node nd such that endpath[nd] ==, TRUE)) { 
endpath[nd] = FALSE; : 
while (there exists a- node i such that 
(onpath[i] == FALSE) && (adjacent(nd,i) == TRUE) 
&& (f{nd] [i] < c{nd}[i})) { s 
/* the flow from nd to ‘i may be increased */ 
7% place i on the semipath af 
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onpath[i] = TRUE; 
endpath[i] = TRUE; 
precede[i] = nd; 
forward[i] = TRUE; 
x = c(nd][i] - flnd)(i];  * ; 
improve[i] = (improve[nd] < x) ? improve[nd] : x; 
} /* end while there exists... */ © 
while (there exists a node i such that (onpath[1] == FALSE) 
&& (adjacent(i,nd) == TRUE) && (f[i][nd] > 0)) { 
/* the flow from i to nd may be decreased */ 
f* place i on the semipath yf 
onpath{i] = TRUE; 
endpath[i]:= TRUE; 
precede[i] = nd; 
forward[i] = FALSE; 
improve{i] = (improve[nd] < f{i][nd]) ? improve[nd] : 
fli) [nd]; 
} /* end while there exists... */ 
ak /* end while (onpath[T) == FALSE) */ 
if (onpath(T) == TRUE) 
we have found a semipath from $ to T; 
else oe 
the flow is already optimum; 


Once a semipath from S to T has been found, the flow may be increased along 
that semipath (line 6 above) by the following algorithm: 


x = improve(T]; 

nd = T; 

while (nd != S) { 
pred = precede[ND}; 
(forward[nd] == TRUE) ? (flpred, nd) += x) : (find; pred) -= x); 
nd = pred; 

} /* end while */ 


This method of solving the flow problem is known as the Ford-Fulkerson algorithm 
after its discoverers. 

Let us now convert these algorithms into a C routine maxflow(cap, s, t, flow, 
totflow), where cap is an input parameter representing a capacity function defined on a 
weighted graph. s and r are input parameters representing the source .. ? sink, flow is an 
output parameter representing the maximum flow function, and totflow is the amount 
of flow from s to t under the flow function flow. 

The previous algorithms may be converted easily into C programs. Five arrays 
endpath, forward, onpath, improve, and precede are requires. The question of whether 
jis adjacent to i can be answered by checking whether or not cap[iJ[j] == 0. 


Uy 
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We present the routine here as a straightforward implementation of the algorithms. 
any is a function that accepts an array of logical values and returns TRUE if any element 
of the array is TRUE. If none of the elements of the array is TRUE, any returns FALSE. 
We leave its coding as an exercise. 


#define MAXNODES 50 
fdefine INFINITY ... 


int any(int []); 


void maxflow (int cap[][MAXNODES], int s, int t, 
int flow[] [MAXNODES], int *ptotflow) 
{ 


int pred, nd, i, x; 
int precede[MAXNODES], improve [MAXNODES] ; 
int endpath[MAXNODES]  forward[MAXNODES], onpath[MAXNODES} ; 


for (nd = 0; nd < MAXNODES; ++nd) 
for (i = 0; i < MAXNODES; ++i) 
WEN = 0; 
*ptotflow = 0; 
do { 
/* attempt to find.a semipath from s tot */ 
for (nd = 0; nd < MAXNODES; ++nd) { 
endpath[nd] = FALSE; 
onpath[nd] = FALSE; 
} /* end for */ 
endpath[s] = TRUE; 
onpath[s] = TRUE; 
improve(s] = INFINITY; 
/* we assume that s can provide infinite flow *j 
while ((onpath[t] == FALSE) & (any(endpath) == TRUE)) { 
/* attempt to extend an existing path */ 
for (nd = 0; endpath[nd] == FALSE; nd++) 


endpath[nd] = FALSE; 
for (i = 0; i < MAXNODES; ++i) { 
if ((Flow[nd} [i] < cap[nd][i]) && (onpath{i] == FALSE)) { 
onpath[i] = TRUE; 
endpath{i] = TRUE; 
precede[i] = nd; 
forward[nd] = TRUE; 
x = cap[nd] (i) - flow[ndj [i]; 
improve[i] = (improve[nd] < x) ? improve(nd] : x; 
} /* end if */ 
if ((flow{i}(nd] > 0) & (onpath[i] == FALSE)) { 
onpath{i] = TRUE; 
endpath[i] = TRUE; 
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precede[i] = nd; 
forward[nd] = FALSE; 
improve[i] = (improve[nd] < flow[i][nd]) ? 
improve[nd] : flow[i][nd]; 
} /* end if */ 


\} /* end for */ 
} /* end while */ 


if (onpath[t] == TRUE) { 
/* flow on semipath to t can be increased */ 
x = improve[t]; 
*ptotflow += x; 
nd = t; 
while (nd != s) { 
/* travel back along path */ 
pred = precede[nd]; 
/* increase or decrease flow from pred */ 
(forward[pred] == TRUE) ? (flow[pred](nd] += x): 
(flow[nd] [pred] -= x); 
nd = pred; i 
} /* end while */ 
} /* end if */ 
} while (onpath(t] == TRUE); /* end do */ \ 
} /* end maxflow */ 


Note.that although we have maintained the arrays as they were specified in the 
algorithm, we could have eliminated the array forward by setting precede{nd] to a pos- 
itive number in the case of a forward arc and to a negative number in the case of a 
backward arc. You are asked to pursue this possibility as an exercise. 

For large graphs with many nodes, the arrays improve and endpath may be pro- 
hibitively expensive in terms of space. Furthermore, a search through all nodes to find 
anode nd such that endpath{nd] = TRUE may be very inefficient in terms of time. An 
alternate solution might be to note that the value of improve is required only for those 
nodes nd such that endpath[nd] = TRUE. Those graph nodes which are at the end of 
semipaths may be kept in a list whose nodes are declared by 


Struct listnode { 
‘int graphnode; 
int improve; 
int next; 


Whena node which is at the end of a semipath is required, remove the first element from 
the list. We can similarly dispense with the array precede by maintaining a separate list 
of nodes for each semipath. However, this suggestion is of dubious value since almost 
all nodes will be on some semipath. You are invited to write the routine maxflow,as an 
exercise using these suggestions to save time and space. 
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8.2.8. 


Find the maximum flows for the graphs in Figure 8.2.1 using the Ford-Fulkerson method 

(the capacities are shown next to the arcs). 

Given a graph and a capacity function as in this section, define a cut as any set of nodes 

x containing S but not T. Define the capacity of the cut x as the sum of the capacities of . 

all the arcs leaving the set x. 

(a) Show that for any flow function f, the value of the-total flow v is less than or equal 
to the capacity of any cut. 

(b) Show that equality in (a) is achieved when the flow is maximum and the cut has 
minimum capacity. 

Prove that the Ford-Fulkerson algorithm produces an optimum flow function. 

Rewrite the routine maxflow using a linked list to contain nodes at the eng of semipaths, 

as suggested in the text. 

Assume that in addition to a capacity funcition for every arc, there is also a cost function. 

cost. cost(a,b) is the cost of each unit of flow from node a to node b. Modify the program 

of the text to produce the flow function which maximizes the total flow from source to 

sink at the lowest cost (that is, if there are two flow functions, both of which produce the 

same maximum flow, choose the one with the least cost). 

Assuming a cost function as in the previous exercise, write a program to produce the 


maximum cheapest flow—that is, a flow function such that the total flow divided by the 
cost of the flow is greatest. 


_ A probabilistic directed graph is one in which a probability function associates a proba- 


bility with each arc. The sum of the probabilities of all arcs emanating from any node is 1. 
Consider an acyclic probabilistic digraph representing a tunnel system. A man is placed 
at one node in the tunnel. At each node he chooses to take a particular arc to another 
node with probability given by the probability function. Write a program to compute the 
probability that the man passes through each node of the graph. What if the graph were 
cyclic? 

Write a C program that reads the following information about an electrical network: 


1. n, the number of wires inthe network 

2. The amount of current entering through the first wire and leaving through the nth 

3. The resistance of each of the wires 2 through n — 1 

4. A set of ordered pairs <i,j> indicating that wire i is connected to wire j and that 
electricity flows through wire j to wire j 


‘ The program should compute the amount of current flowing through cach of wires 2 


~ though n — | by applying Kirchhoff’s law and Ohm's law. Kirchhoff’s law states that the 


amount of current flowing into a junction equals the amount leaving a junction. Ohm's: 
law states that if two paths exist between two junctions, the sums of the currents times | 
the resistances over all wires in the two paths are equal. 


8.3 LINKED REPRESENTATION OF GRAPHS 


The adjacency matrix representation of a graph is frequently inadequate because it 
requires advance knowiedge of the number of nodes. If a graph must be constructed in 
the course of solving a problem, or if it must be updated dynamically as the program 
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proceeds, a new matrix must be created for each addition or deletion of a node. This is 
prohibitively inefficient, especially in a real-world situation where a graph may have a 
hundred or more nodes. Further, even if a graph has very few arcs so that the adjacency 
matrix (and the weight matrix for a weighted giaph) is sparse, space must be reserved 
for every possible arc between two nodes, whether or not such an arc exists. If the graph 
contains n nodes, a total of n? locations must be used. 

As vou might expect, the remedy is to use a linked structure, allocating and freeing 
nodes from an available pool. This is similar to the methods used to represent dynamic 
binary and general trees. In the linked representation of trees, each allocated node cor- 
responds to a tree node. This is possible because each tree node is the son of only one 
other tree node and is therefore contained in only a single list of sons. However, in a 
graph an arc may exist between any two graph nodes. It is possible to keep an adja- 
cency list for every node in a graph (such a list contains all nodes adjacent to a given 
node) and a node might find itself on many different adjacency lists (one for each node 
to which it is adjacent). But this requires that each allocated node contain a variable 
number of pointers, depending on the number of nodes to which it is adjacent. This 
solution is clearly impractical as we saw in attempting to represent general trees with 
nodes.containing pointers to each of its sons. 

An alternative is to construct a multilinked structure in the following way. The 
nodes of the graph (hereafter referred to as graph nodes) are represented by a linked 
list of header nodes. Each such header node contains three fields: info, nextnode, and 
arcptr. If p points to a header node representing a graph node a, info(p) contains any 
information associated with graph node a. nextnode(p) is a pointer to the header node 
tepresenting the next graph node, if any. Each header node is at the head of a list of nodes 
of a second type called list nodes. This list is called the adjacency list. Each node on 
an adjacency list represents an arc of the graph. arcptr(p) points to the adjacency list of 
nodes representing the arcs emanating from the graph node a. 

Each adjacency list node contains two fields: ndptr and nextarc. If q points to a list 
node representing an arc <A,B>, ndptr(q) is a pointer to the header node representing 
the graph node B. Nextarc(q) points to a list node representing the next arc emanating 
from graph node A, if any. Each list node is contained in a single adjacency list repre- 
senting all arcs emanating from a given graph node. The term allocated node is used 
to refer to either a header or a list node of a multilinked structure representing a graph. 
We also refer to an adjacency list node as an arc node. 

Figure 8.3.1 illustrates this representation. If each graph node carries some in- 
formation but (since the graph is not weighted) the arcs do not, two types of allocated 
nodes are needed: one for header nodes (graph nodes) and the other for adjacency list 
nodes (arcs). These are illustrated in Figure 8.3.1a. Each header node contains an info 
field and two pointers. The first of these is to the adjacency list of: <s emanating from 
the graph node, and the second is to the next header node in the grapi. Each arc node 
contains two pointers, one to the next arc node in the adjacency list and the other to the 
header node representing the graph node that terminates the arc. Figure 8.3.1b depicts 
a graph and Figure 8.3.1c its linked representation. 

Note that header nodes and list nodes have different formats and must be repre- 
sented by different structures. This necessitates either keeping two distinct available 
lists or defining a union. Even in the case of a weighted graph in which each list node 
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A sample header node representing a graph node. A sample list node representing an arc. 
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(b) A graph. 


<A, B> 


(c) Linked representation of a graph. 


Figure 8.3.1 Linked representation of a graph. 


contains an info field to hold the weight of an arc, two different structures may be neces- 
sary if the information in the header nodes is not an integer. However, for simplicity we 
make the assumption that both header and list nodes have the same format and contain 


two pointers and a single integer information field. These nodes are declared using the 
array implementation as 


#define -MAXNODES 500 


Struct nodetype { l 
int info; 
int point; 
int next; 
Ii 
struct nodetype. node [MAXNODES] ; 


In the case of a header node, node[p] represents a graph node A, node[p].info rep- 
resents the information associated with the graph node A, node[p].next points to the 
next graph node, and node(p).point points to the first list node representing an arc 
emanating from A.-In the case of a list node, node[p] represents an arc <A,B>, 
node[|p}.info represents the weight of the arc, node[p].next points to the next arc 
emanating from A, and node[p].point points to the header node representing the 
graph node B. 


Alternatively, we may use the dynamic implementation, declaring the nodes as 
follows: 


struct nodetype { 

int info; 

struct nodetype *point; 
struct nodetype *next; 
‘oe 
struct nodetype *nodeptr; 


We use the array implementation in the remainder of this section and assume the exis- 
tence of routines getnode.and freenode. 

We now present the implementation of the primitive graph operations using the 
linked representation. The operation jeinwt(p,g,wt) accepts two pointers p and q to two 
header nodes and creates an arc between them with weight wr, If an arc already exists 
between them, that arc’s weight: is set to wt. 


void joinwt (int p, int q, int wt) 


int m; r2; 

/* search the list of arcs emanating from node[p] */ 
/* for an arc to node[q] : */ 
r2 = -1; l 


r = node[p].point; 
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} 


while (r >= 0 & node[r].point != q) { 
rer; 
r = node[r].next; 

} /* end while */ 

if (r >= 0). { 
/* node(r) represents an arc from */ 


i node(p] to node[q] *} 
node[r].info = wt; 
return; 
} /* end if */ 
/* an arc frommode[p} to node{q] does not */ 
y% exist. Such an arc must be created. i 


r = getnode(); 

node[r].point = q; 

node[r].next = -1; 

node{r].info = wt; 

(r2 < 0) ? (node(p].point = r) : (node{r2].next = r); 
/* end joinwt */ 


We leave the implementation of the operation join for an unweighted graph as an exer- 


cise for the reader. The operation remv(p.q) accepts pointers to two header nodes and 
removes the arc between them, if one exists. 


void remv (int p, int a) 


{ 


} 


int r, 2; 


rn = -1; 
r = node[p].point; 
while (r >= 0 & node[r].point != q) { 
rer; 
r = node(r].next; 
} /* end while */ 
if (r >= 0) { i 
/* r points to an arc from node[p] */ 
a to node(q] */ 
(r2 < 0) ? (node[p}].point = node[r].next): 
(node[r2].next = node[r].next); 
freenode(r) ; 
return; 
} /* end if */ 
/* if no arc has been found, then no action */ 
/* need be taken */ 
/* end rem: */ 


We leave the implementation of the operation remvwt(p.g.x), Which sets x to the wei ght 


of the arc <p,g> in a weighted graph and then removes the arc from the graph. as an 
exercise for the reader. 
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The function adjacent(p,q) accepts pointers to two header nodes and determines 
whether node(q) is adjacent to node(p). 


int adjacent (int p, int q) 
{ 
intr; 


r = node[p].point; 
while (r >= 0). 
if (node[r].point == q) 
return (TRUE); 
else 
r = node[r].next; 
return (FALSE); 
} /* end adjacent */ 


Another useful function is findnode(graph, x) which returns a pointer to a header 
node with information field x if such a header node exists, and returns the null pointer 
otherwise. 


int findnode (int graph, int x) 
{ 
‘int p; 


p = graph; 
while (p >= 0)° 
- if (node[{p].info == x) 


p = node[p].next; 
return (-1); 
. } /* end findnode */ 


The function addnode(&graph, x)'adds a node with information field x to a graph 
and returns a pointer to that node. 


‘int addnode (int *pgraph, int x} 
{ 
int p; 


p = getnode(); 
node[p].info = x; 
node[p].point = -1; 
node[p].next = *pgraph; 
*pgraph = p; 

return (p); ` 

/* end addnode */ 
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The reader should be aware of another important difference between the adja- 
cency matrix representation and the linked representation of graphs. Implicit in the 
matrix representation is the ability to traverse a row or column of the matrix. Travers- 
ing a row is equivalent to identifying all arcs emanating from a given node. This can 
be done efficiently in the linked representation by traversing the list of arc nodes start- 
ing at a given header node. Traversing a column of an adjacency matrix, however, is 
equivalent to identifying all arcs that terminate at a given node; there is no correspond- 
ing method for accomplishing this under the linked representation. Of course, the linked 
representation could be modified to include two lists emanating from each header node: 
one for the arcs emanating from the graph node and the other for the arcs terminating 
at the graph node. However, this would require allocating two nodes for each arc, thus 
increasing the complexity of adding or deleting an arc. 

Alternatively, each arc node could be placed on two lists. In this case, an arc node 
would contain four pointers: one to the next arc emanating from the same node, one to 
the next arc terminating at the same node, one to the Feader node at which it terminates 
and one to the header node from which it emanates. A nuader node would contain three 
pointers: one to the next header node, one to the list of arcs emanating from it and one to 
the list of arcs terminating at it. The programmer must, of course, choose from among 
these representations by examining the needs of the specific problem and considering 
both time and storage efficiency. 

We invite the reader to write a routine remvnode(graph, p) that removes a header 
node pointed to by p from a graph pointed to by graph using the various graph repre- 
sentations outlined in the foregoing. Of course, when a node is removed from a graph 
all arcs emanating and terminating at that node must also be removed, In the linked 
representation which we have presented there is no easy way of removing a node from 
a graph since the arcs terminating at the node cannot be obtained directly. 


Dijkstra’s Algorithm Revisited 


In Section 8.1 we presented an implementation of Di ijkstra’s algorithm for finding 
the shortest path between two nodes in a weighted graph represented by a weight matrix. 
That implementation was O(n”), where n is the number of nodes in the graph. We now 
show how the algorithm can be implemented more efficiently in most cases if the graph 
is implemented using adjacency lists. 

We suggest review of the algorithm described in Section 8.1. This algorithm may 
be outlined as follows. We seek a shortest path from s to t. «pd is to be set to the shortest 
distance; precede[i] to the node preceding node i in the shortest path: 


for (all nodes i) { 
distance[i] = INFINITY; 
perm[i] = NONMEMBER; 


perm[s] = MEMBER; 
distance[s] = 0; 
‘current = sS; 


NOW Se wee, 
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8 while (current != t) { 


9 dc = distance[(current] ; f 
10 for (all nodes i that are successors of current) { 

11 newdist = dc + weight[current] [i]; 

12 if (newdist < distance[i]) { 

13 distance[i] = newdist; 

14 precede[i] = current; 

15 } /* end if */ 


16 } /* endfor */ 
17 k = the node k such that perm[k] ==, NONMEMBER and 


such that distance[k] is smallest; 
18 current = k; 
19 perm[k] = MEMBER; 
20 } /* end while */ 
21 ‘*pd = distance[t]; 


Review how this algorithm is implemented in Section 8.1. Note especially how 
finding the minimum distance (line 17) is incorporated into the for loop and how that 
loop is implemented. 

The keys to an efficient implementation are lines 10 and 17. In Section 8.1, where 
we had access only to a weight matrix, there is no way to limit the access to the succes- 
sors of current as specified in line 10. Jt is necessary to traverse all n nodes of the graph 
each time the inner loop is repeated. We are able to increase efficiency by looking only 
at elements not in perm, but that cannot speed things up by more than a constant factor. 
Once an O(n) inner loop is required, we may.as well use it to compute the minimum as 
well (line 17). 

However, given an adjacency list representation of the graph, it is possible to tra- 
verse directly all nodes adjacent to current without examining all graph nodes. There- 
fore. the total number of nodes i examined in the loop headed by line 10 is O(e), where 
e is the number of edges in the graph. [Note that we are not saying that each exe- 
cution of the inner loop is O(e) but that the total of all repetitions of all passes of 
the inner loop is O(e).] In most graphs, e is far smaller than n*, so this is quite an 
improvement. 

However, we are not yet done. Since we are eliminating a traversal througn all 
nodes, we must find an alternative way of implementing line 17 to find the node with 
the smallest distance. If the best we can do in finding this minimum distance is O(n), 
the entire process remains O(n”). 

Fortunately, there is a solution. Suppose that, instead of maintaining the array 
perm, we maintained its complement, notperm. Then line 3 would become 


3 notperm[i] = MEMBER; 
line 5 would become 
5 notperm[s] = NONMEMBER; 


line 17 would become 
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17 k = the node k such that notperm[k] == MEMBER and 
such that distance[k] is smallest; 


and line 19 would become 


`~ 


19 notperm[k] = NONMEMBER; 


The operations performed on the array notperm are creation [line 5; this may be 
O(n) but it is outside the while loop and therefore does not hurt the overall efficiency], 
finding the minimum element (line 17), and deleting the minimum element (line 19). 
But these latter two operations can be combined into the single pgmindelete operation 
of an ascending priority queue and, by now, we have a number of ways of implementing 
that operation in less than O(n). In fact, we can implement pqmindelete in O(log n) by 
using an ascending heap, a balanced binary tree, or a 2-3 tree. If the set notperm is 
implemented as a priority queue using one of these techniques, the efficiency of n such 
operations is O(n log n). If a priority queue ordered by the value of distance is used to 
implement notperm, the position of i must be adjusted in the priority queue whenever 
distance{i] is modified in line 13. Fortunately, this can also be done in O(log n) steps. 

Thus Dijkstra’s algorithm can be implemented using O((e + n)log n) operations, 
which is significantly better than O(n") for sparse graphs (that is, graphs with very 
few edges as opposed to dense graphs that have an edge between almost every pair of 
nodes). We leave an actual C implementation as an exercise for the reader. 


Organizing the Set of Graph Nodes 


In many applications, the set of graph nodes (as implemented by header nodes) 
need not be organized as a simple linked list. The linked list organization is suitable 
only when the entire set of graph nodes must be traversed and when graph nodes are 
being dynamically inserted. Both of these operations are highly efficient on a linked 
list. 

If graph nodes must also be deleted, the list must be doubly linked. In addition, as 
noted earlier, there is the need to ensure that no arcs emanate or terminate at a deleted 
node or that all such arcs are deleted as part of the node deletion routine. If we choose 
merely to ensure that no arcs terminate in a node being deleted rather than to delete 
any such arcs, it is not necessary to keep with each node a list of arcs terminating at 
the node. It is only necessary to maintain a count field in the node to hold the number 
of arcs terminating at the node; when count becomes 0 (and no arcs terminate at the 
node), the node may be deleted. 

If graph nodes are not being added or deleted, the nodes can be kept in a simple 
array, where each array element contains any necessary information about the node 
plus a pointer to an adjacency list of arcs. Each arc need contain only an array index to 
indicate the position of its terminating node in the array. 

In many applications, graph nodes must be accessed by their contents. For exam- 
ple. in a graph whose nodes represent cities, an application must find the appropriate 
node given the name of the city. If a linked list is used to represent the graph nodes, the 
entire list must be traversed to find the node associated with a particular name. 
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The problem of finding a particular element in a set based on its contents, or value, 
is one that we have already studied in great detail: it is simply the searching problem. 
And we know a great many possible solutions; binary search trees, multiway search 
trees, and hash tables are all ways of organizing sets to permit rapid searching. 

The set of graph nodes can be organized in any of these ways. The particular 
organization chosen depends on the detailed needs of the application. In Dijkstra's al- 
gorithm, for example, we have just seen an illustration where the set of graph nodes 
could be organized as an array that implements an ascending heap used as a priority 
queue. Let us now look at a different application. We introduce it with a frivolous ex- 
ample, but the application itself is quite important. 


Application to Scheduling 


Suppose a chef in a diner receives an order for a fried egg. The job of frying an 
egg can be decomposed into a number of distinct subtasks: 


Get egg Crack egg Get grease 
Grease pan Heat grease Pour egg into pan 
Wait until egg is done Remove egg 


Some of these tasks must precede others (for example, “get egg” must precede 
“crack egg”). Others may be done simultaneously (for example, “get egg” and “heat 
grease”). The chef wishes to provide the quickest service possible and is assumed to 
have an unlimited number of assistants. The problem is to assign tasks to the assistants 
SO as to complete the job in the least possible time. 

Although this example may seem frivolous, it is typical of many real-world 
scheduling problems. A computer system may wish to schedule jobs to minimize 
turnaround time; a compiler may wish to schedule machine language operations to 
minimize execution time; or a plant manager may wish to organize an assembly line to 
minimize production time. All these problems are closely related and can be solved by 
the use of graphs. 

Let us represent the above problem as a graph. Each node of the graph represents 
a subtask and each arc <x,y> represents the requirement that subtask y cannot be per- 
formed until subtask x has been completed. This graph G is shown in Figure 8.3.2. 


Figure 8.3.2 Graph G. 
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Figure 8.3.3 Graph T. 


Consider the transitive closuf® of G. The transitive closure is the graph T such 
that <x,y> is an arc of T if and only if there is a path from x to y in G. This transitive 
closure is shown in Figure 8.3.3. 

In the graph T, an arc exists from node x to node y if and only if subtask x must 
be performed before subtask y. Note that neither G nor T can contain a cycle, since 
if a cycle from node x to itself existed, subtask x could not be performed until after 
subtask x had been completed. This is clearly an impossible situation in the context of 
the problem. Thus G is a dag, a directed acyclic graph. 

Since G does not contain a cycle, there must be at least one node in G which has no 
predecessors. To see this suppose that every node in the graph did have a predecessor. 
In particular, let us choose a node z that has a predecessor y. y cannot equal z or the 
graph would have a cycle from z to itself. Since every node has a predecessor, y must 
also have a predecessor x that is not equal to either y or z. Continuing in this fashion, a 
sequence of distinct nodes 


my % W, V, Uy... 


is obtained. If any two nodes in this sequence were equal, a cycle would exist from 
that node to itself. However, the graph contains only a finite number of nodes so that 
eventually, two of the nodes must be equal. This is a contradiction. Thus there must be 
at least one node without a predecessor. 

In the graphs of Figures 8.3.2 and 8.3.3, the nodes A and F do not have pre- 
decessors. Since they have no predecessors the subtasks that they represent maybe 
performed immediately and simultaneously witi,out waiting for any other subtasks to be 
completed. Every other subtask must wait until at least one of these is completed. Once 
these two subtasks have been performed, their nodes can be removed from the graph. 
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Note that the resulting graph does not contain any cycles, since nodes and arcs have 
been.removed from a graph that originally contained no cycles, Therefore the resulting 
graph must also contain at least one node with no predecessors. In the example, B and 
H are two such nodes. Thus the subtasks B and H may be performed simultaneously in 
the second time period. 

Continuing in this fashion we find that the minimum time in which the egg can 
be fried is six time periods (assuming that every subtask takes exactly one time period) 
and that a maximum of two assistants need be employed, as follows: 


Time period Assistant I Assistant 2 


Get egg Get grease 
Crack egg Grease pan 
Heat grease 

Pour egg into pan 

Wait until done 

Remove egg 


AuUuswn— 


The above process can be outlined as follows: 


1. Read the precedences und construct the graph. 
2. Use the graph to determine subtasks that can be done simultaneously. 


Let us refine each of these two steps. Two crucial decisions must be made in 
refining step 1. The first is to decide the format of the input, the second is to decide 
on the representation of the graph. Clearly. the input must contain indications of which 
subtasks must precede others. The most convenient way to represent these requirements 
is by ordered pairs of subtasks; each input line contains the names of two subtasks where 
the first subtask on a line must precede the second. Of cqurse, the data must be valid in 
the sense that no subtask may precede itself (no cycles are permitted in the graph). Only 
those precedences that are implied by the data and the transitive closure of the resulting 
graph are assumed to hold. A subtask may be represented by a character string such as 
“get egg” or by anumber. We choose to represent subtasks by character strings in order 
that the input data reflect the real-world situation as closely as possible. 

What information should be kept with each node of the graph? Clearly, the name 
of the subtask that the node represents is needed to locate the node associated with a 
particular task and for output purposes. This name will be kept as an array of single 
characters. The remaining information depends on how the graph is used. This will 
become apparent only after step 2 is refined. Here is a good example of how the various 
parts of a program outline interact with each other to produce a single unit. 

Step 2 can be refined into the following algorithm: 


while (the graph is not empty) { 
determine which nodes have no predecessors; 
output this group of nodes with an indication that they 
can be performed simultaneously in the next time period; 
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remove these nodes and their incident arcs from the graph; 
} /* end while */ 


How can it be determined which nodes have no predecessors? One method is to 
maintain a count field in each node containing the number of nodes that precede it. Note 
that we are not interested in which nodes precede a given node—only in how many. 

Initially, after the graph has been constructed, we examine all the graph nodes and 
place those with zero count on an output list. Then, during each simulated time period, 
the output list is traversed, each graph node on the list is output, and the adjacency list 
of arcs emanating from that graph node is traversed. For each arc, the count in the graph 
node that terminates the arc is reduced by 1, and if the count thereby becomes 0, that 
terminating graph node is placed on the output list of the next time e period. Atthe same 
time, the arc node is freed. 

The refinement of step 2 may then be rewritten as follows: 


/* traverse the set of graph nodes and place all those */ 
/* nodes with 0 count on the initial output list */ 
1 outp = NULL; 
2 for (all node{p) in the graph) 
3 if (count(p) == 6) { 
4 remove node(p) from the graph; 
5 place node(p) on the output list; 
6 } /* endif */ 
/* simulate the time periods */ 
7 period = 0; 
8 while (outp != NULL) { 
9° ++period; 
10 printf ("%d\n", period); 
/* initialize the next period's output list */ 
11 nextout = NULL; 
/* traverse the output list */ 


12 p= outp; 
13 while (p != NULL) { 
14 printf("%s", info(p)); 


for (all arcs a emanating from node(p)) { 
/* reduce count in terminating */- 


/* node gi 
16 t = the pointer to the node that terminates a; 
17 count(t)--; 
16 if (count(t) == 0) { 
15 remove node(t) from the graph; 
20 f add node(t) to the nextout list; 
21 } /* end if */ 
22 free arc (a) 
23 } /* end for */ 
24 q = next(p); 
25 free node(p); 
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26 Paq; 

27 } /* end while ptr 
28 outp = nextout; 

29 } /* end while “y 

30 if (any nodes remain in the graph) 

31 error - there is a cycle in the graph; 


C Program 


Let us first indicate the structure of the nodes that are required. The header nodes 
that represent graph nodes contain the following fields: 


info ~ the name of the subtask represented by this node 
count the number of predecessors of this graph node 
arcptr a pointer to the list of arcs emanating from this node 
nextnode a pointer to the next node in the Output list 


Each list node representing an arc contains two pointers: 


nodeptr `a pointer to its terminating node 
nextarc a pointer to the next arc in the adjacency list 


Thus two types of nodes are required: one to represent graph nodes and one to | 
represent arcs. These may be declared by 


#define MAXGRAPH 
#define MAXARC 
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struct graphtype { 
char info[20); 
int count; 
int arcpointer; 
int nextnode; 
}; 
struct arctype { 
int nodeptr; 
int nextarc; 
k 
struct graphtype graphnode[MAXGRAPH] ; 
struct arctype arc[MAXARC]; 


The array graphnode is a hash table, with rehashing used to resolve collisions. The array 
arc is a list of available arc nodes allocated by a routine getarc and freed by freearc. 
These manipulate an available pointer availarc. 

We also assume the existence of a function find(inf) that searches graphnode for 
the presence of an element nd (that is, a graph node) such that graphnode[nd).info 
‘equals inf. If no such graph node exists, find allocates a previously empty position nd, 
sets graphnode[nd].info to inf, graphnode|nd).count to 0 and graphnode|nd].arcptr to 
—1, and increases the count of the number of nodes in the graph (which is maintained in 
a variable numnodes) by-1. In either case, find returns nd. Of course, nd is determined 
within find via functions hash and rehash applied to inf. 

A routine join is also used. This routine accepts pointers to two graph nodes, 
nl and n2, and allocates an arc node (using getarc) that is established as an arc from, 
graphnode[n1] to graphnode|n2). join is responsible for adding the arc node to the list 
of arcs emanating from graphnode(n1] as well as for increasing graphnode|n2}.count 
by 1. Finally, the routines strcpy and strcmp are used w copy strings and compare them 
for equality. 

We may now write a C scheduling program: 


#include <string.h> 
#define MAXGRAPH 
#define MAXARC... 
#define NULLTASK "" 
#define TRUE 1 
#define FALSE 0 
struct graphtype { 

char info[20); 

int Count; 

int arcptr; 

int pextnode; 
N 


struct arctype { 
int nodeptr; 
int nextarc; 


Ji 


Sec. 8.3 Linked Representation of Graphs 555 


556 


Struct graphtype graphnode[MAXGRAPH] ; 
Struct arctype arc[MAXARC]: 

int availarc; 

int numnodes = 0; /* number of graph nodes */ 
int find(char *); 

int getarc(int, int); 

int hash(int); 

int rehash(int); 

void join(int, int); 

void freearc(int); 

main() 


int p,q. r,s, t, outp, nextout, period; 
char inf1[20), inf2[20]; 
/* initialize graph nodes and available list of arcs */ 
for (p = 0: p< MAXGRAPH; ++p) 
strcpy(graphnode[p]. info, NULLTASK); 
for (s = 0; s < MAXARC -1 + ++5) 
arc[s].nextarc = s + 1; 
arc[MAXARC - 1].nextarc = -1; 
availarc = 0; 
while(scanf("%s %s", infl, inf2) != EOF) { 
p = find(infl); 
q = find(inf2);" 
join(p,q); 
} /* end while */ 


/* The graph has been constructed. Traverse the hash table and 
/* place all graph nodes with zero count on the output list. */ 


outp = -1; 
for (p = 0; p < MAXGRAPH; ++p) 
if ((strcmp(graphnode[p). info, NULLTASK) == FALSE) && 


(graphnode[p].count == 0)) { 


graphnode[p].nextnode = outp; 
Outp = p; 
} /* end if */ 
/* simulate the time periods */ 
period = 0; 
while (outp != -1) { 
++period; 
printf("%d\n", period); 
/* initialize output list for next period */ 
nextout = -]; 
/* traverse the output list */ 
“p = outp; 
while (p != -1) { 
printf("%s\n", graphnode[p]. info); 
r = graphnode[p]}.arcptr; 
/* traverse the list of arcs */ 
while (r != -1) { 
s = arc[r].nextarc; 
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t = arc(r].nodeptr; 
--graphnode[t] .count; 
if (graphnode[t].count == 0) { 
/* place graphnode[{t] on the next */ | 
/* period's output list */ 
graphnode[t].nextnode = nextout; 
nextout = t; 
} /* end if */ 
freearc(r); 
r= S; 
} /* end while */ 
/* delete the graph node */ 
strcpy(graphnode(p] . info, NULLTASK) ; 
--numnodes ; 
/* continue traversing the output list */ 
p = graphnode(p]}.nextnode; 
} /* end while (p != -1) */ 
/* reset output list for the next period */ 
outp = nextout; 
} It, end while (outp != -1) */ 
if (numnodes != 0) 
error("error in input - graph contains a cycle\n"); 


} /* end schedule */ 


EXERCISES 


8.3.1. 


8.3.2. 
8.3.3. 


8.3.4. 


8.3.5. 


Implement a graph using linked lists so that each header node heads two lists: one con- 
taining the arcs emanating from the giuph node and the other containing the arcs termi- 
nating at the graph node. 

Implement a graph so that the lists of header nodes and arc nodes are circular. 
Implement a graph using an adjacency matrix represented by the sparse matrix tech- 
niques of Section 8.1. 

Implement a graph using an array of adjacency lists. Under this representation, a graph 
of n nodes consists of n header nodes, each containing an integer from Oton- I and 
a pointer. The pointer is to a list of list nodes each of which contains the node number 
of a node adjacent to the node represented by the header node. Implement Dijkstra’s 
algorithm using this graph representation with the array formed into an ascending heap. 
There may be more than one way to organize a set of subtasks in a minimum number of 
time periods. For example, the subtasks in Figure 8.3.2 may be completed in six tir 2 
periods in one of three different methods: 


EE 
Period Method I Method 2 Method 3 
ane Le 

1 AF F: A,F 
a B.H AH F. 
3 l Bl Ł 
4 G Ç ( 
5 D. D L 
6 E E E 
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8.3.6. 


8.3.8. 


Figure 8.3.4 


Write a program to generate all possible methods of organizing the subtasks in the min- 
imum number of time periods. 


Consider the graph of Figure 8.3.4. The program schedule outputs the following orga- 
nization of tasks: 


Time Subtasks 


AUN= 
> 


This requires three assistants (for time period 1). Can you find a method of organizing 
the subtasks so that only twò assistants are required at any time period, yet the entire 
job can be accomplished in the same four time periods? Write a program that organizes 
Subtasks so that a minimum number of assistants are needed to complete the entire job 
in the minimum number of time periods. : 
If there is only one worker available, it will take k time periods to complete the entire 
job, where k is the number of subtasks. Write a program to list a valid order in which 
the worker can perform the tasks. Note that this program is simpler than schedule, since 
an output list is not needed; as soon as the count field reaches 0 the task may be output. 
The process of converting a set of precedences into a single linear list in which no later 
element precedes an earlier one is called a topological sort. 
A PERT network is a weighted acyclic directed graph in which each arc represents 
an activity and its weight represents the time needed to perform that activity. If arcs 
<a,b> and <b,c> exist in the network, the activity represented by arc <a,b> must 
be completed before the activity represented by <b,c> can be started. Each node x of 
the network represents a time at which all activities represented by arcs terminating at 
x can be completed. 
(a) Write aC routine that accepts a representation of such a network and assigns to each 
node x the earliest time that all activities terminating in that node can be completed. 
Call this quantity er(x). (Hint: Assign time 0 to all nodes with no predecessors. 
If all predecessors of a node x have been assigned times, e7(.x) is the maximum over 
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pas (c) Figure 8.3.5 Some PERT networks. 
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all predecessors of the sum of the time assigned toa predecessor and the weight of 
the arc from that predecessor to x.] ~ 


(b) Given the assignment of times in part (a), write a C routine that assigns to each 


(c) 


(e) 
(Ca) 


Exercises 


— 


node x the latest time that all activities terminating in x can be completed without 
delaying the completion of all the activities, Call this quantity I(x). (Hint: Assign 
{ime er(x) to all nodes x with no successors. If all suceessors of a node x have been 
signed times, /t(x) is the minimum over all Successors of the difference between 

the -ime assigned to a successor and the weight of the arc from x to the successor.) 
“Prove that there is at least one path in the graph from a node with no predecessors 
to a node with no successors such that et(x) = It(x) for every node x on the path. 
Such a path is called a critical path. ` 

Explain.the significance of a critical path by showing that reducing the time of the 
activities along every critical path reduces the earliest time by which the entire job 
can be completed. -É ; "a 

Write a C routine to find all critical paths in a PERT network. 

Find the critical paths in the networks of Figure 8.3.5. 


8.3.9. Write a C program that accepts a repesentation of a PERT network as given in Exercise 
8.3.8 and computes the earliest time in which the entire job can be finished, if as many 
activities as possible may be performed in parallel. The program should also print the 
starting and ending time of each activity in the network. Write another C program to 
schedule the activities so that the entire job can be completed at the earliest possible 
time subject to the constraint that at most m activities can be performed in parallel. 


8.4 GRAPH TRAVERSAL AND SPANNING FORESTS 


A great many algorithms depend on being able to traverse a graph. In this section we 
examine techniques for systematically accessing all the nodes of a graph and present 
several useful algorithms that implement and use those traversal techniques. We also 
look at ways of creating a general forest that is a subgraph of given graph G and contains 
all the nodes of G. 


Traversal Methods for Graphs 


It is often desirable to traverse a data structure, that is, to visit each of its elements 
in a systematic manner. We have already seen traversal techniques for lists and trees; 
we now examine traversal techniques for graphs. 

The elements of the graph to be visited are usually the graph nodes. It is always 
possible to traverse a graph efficiently by visiting the graph nedes in an implementation- 
dependent manner. For example, if a graph with n nodes is represented by an adjacency 
matrix or an array of adjacency lists, simply listing the integers from 0 te n — 1 “tra- 
verses” the graph. Similarly, if the graph nodes are maintained as a linked list, a search 
tree, a hash table or some other structure, traversing the underlying structure might be 
considered a “traversal” of the graph. However, of greater interest is a traversal that 
corresponds to the graph structure of the object, not one for the underlying implemen- 
tation structure. That is, the sequence in which the nodes are visited should relate to the 
adjacency structure of the graph. 

Defining a traversal that relates to the structure of a graph is more complex than 
for a list or a tree for three reasons: 


1. In general, there is no natural “first” node in a graph trom which the traversal 
should start, as there is a first node in a list or a root in a tree. Further, once a 
starting node has been determined and all nodes reachable from that node have 
been visited, there may remain other nodes in the graph that have not been visited 
because they are not reachable from the starting node. This is again unlike a list 
or tree where every node is reachable from the header or the root. Thus, once all 
reachable nodes in a graph have been visited, the traversal algorithm again faces 
the problem of selecting another starting node. 


2. There is no natural order among the successors of a particular node. Thus there is 
no a priori order in which the successors of a particular node should be visited. 
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3. Unlike a node of a list or a tree, a node of a graph may have more than one pre- 
decessor. If node x is a successor of both nodes y an z, x may be visited after 
y but before z. It is therefore possible for a node to i visited before one of its 
predecessors. In fact. if a graph is cyclic, every possible traversal must include 
some node that is visited before one of its predecessors. 


To deal with these three complications, any graph traversal method incorporates 
the following three features: 

1. The algorithm is either presented with a starting node for the traversal or 
chooses a random node at which to start. The same traversal algorithm produces a dif- 
ferent ordering of the nodes depending on the node at which it starts. In the following 
discussion, s denotes the starting node. 

We also assume a function selecr with no parameters that chooses an arbitrary 
unvisited node, The select operation is usually dependent on the graph representation. 
If the graph nodes are represented by the integers 0 to n — 1, select maintains a global 
variable /ast (initialized to — 1) that keeps track « ° the last node selected by select and 
utilizes a flag visited(i) that is true only if node(i) has been visited. The following is an 
algorithm for select: 


for (i = last + 1; i < n && visited(i); i++) 


if (i == n) 
return(-1) 

last = i; 

return(i); 


A similar select routine can be implemented if the graph nodes are organized as a linked 
list, with Jast being a pointer to the last header node selected. 

2. Generally. the implementation of the graph determines the order in which the 
successors of a node are visited. For example, if the adjacency matrix implementation 
is used, the node numbering (from 0 to n — 1) deicrmines the order; if the adjacency 
list implementation is used, the order of the arcs on the adjacency list determines the 
order in which the successors are visited. Alternatively, and much less commonly, the 
algorithm may choose a random ordering among the successors of a node. We consider 
two operations: firstsucc(x), which returns a pointer to the “first” successor of node(x), 
and nextsucc(x,y). where node(y) is a successor of node(x), which returns a pointer 
to the “next” successor of node(x) following node(y). Let us examine how to imple- 
ment these fiynettons under both the adjacency matrix and linked representations of 
a graph. j 

In the adjacency matrix representation, if x and y are indices such that node(y) is a 
successor of node(x), the next successor of x following y can be computed as the lowest 
index į greater than y such that adj(x,i) is true. Unfortunately, things are not so simple for 
the linked representation. If x and y represent two graph nodes in a graph representation 
that uses adjacency lists (x and y can be either array indices or pointers to header nodes), 
there is no way to access the “next” successor of node(x) following node(y). This is 
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because, in the adjacency list representation, the ordering of successors is based on the 
ordering of arc nodes. It is therefore necessary to locate the arc node following the arc 
node that points to node(y). But there is-no reference from node(y) to the arc nodes that 
point to it, and therefore no way to get to the next arc node. It is therefore necessary for y . 
to point to an arc node rather than a graph node, although the pointer actually represents 
the graph node terminating that arc [that is, node(ndptr(y))]. The next successor of 
node(x) following that graph node can then be found as node(ndptr(nextarc(y))), that 
is, the node that terminates the arc that follows the arc node node(y) on the adjacency 
list emanating from node(x). 

To employ a uniform calling technique for firstsucc and nextsucc under all graph 
implementations, we present them as subroutines rather than functions: 


firsisucc(x, yptr, ynode) sets both yptr and ynode to the index of the first successor 
of node(x) under the adjacency matrix representation. Under the linked represen- 
tation, ynode is set to a pointer to the header node (or a node number) of the first 
successor of node(x), and yptr is set to point to the arc node representing the arc 
from node(x) to node(ynode). 


nextsucc(x, yptr, ynode) accepts two array indices (x and yptr) in the adjacency 
matrix representation and sets both yprr and ynode to the array index of the suc- 
cessor of node(x) that follows node(yptr). In the linked representation, x is an 
array index or a pointer to a header node, yprr is a pointer to an arc node and is 
reset to point to the arc node that follows node(yptr) on the adjacency list, and 
ynode is set to point to the header node that terminates the arc node pointed to by 
the modified value of yprr. 


Given these conventions, an algorithm to visit all successors of node(x) can be 
written as follows: 


firstsucc(x, yptr, ynode) ; 
while (yptr != NULL) { 
visit(ynode); 
nextsucc(x, yptr, ynode) ; 
} /* end while */ 


This algorithm will operate correctly under both implementations. 
Let us now present algorithms for firstsucc and nextsucc. If the adjacency matrix 
implementation is used, nextsucc(x, yptr, ynode) is implemented as follows: 


for (i = yptr +1; i< n; i++) 
if (adj(x,i)) { 
yptr = ynode = 7; 
return; 
} /* end for ... if */ 
yptr = ynode = null; 
return; 
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Sfirstsuce(x, yptr, Ynode) is implemented by 


nextsucc(x,-1, ynode) ; 
yptr = ynode; 


Note that traversing all of a node's successors in a graph of n nodes is Oln) using the 
adjacency matrix representation. 

If the linked representation is used, nextsucc is implemented quite simply as fol- 
lows. (We assume an arcptr. field in each header node and ndptr and nexiarc fields in 
each arc node.) 


yptr = nextarc(yptr); 
ynode = (yptr == NULL) ? NULL : ndptr(yptr); 


Jirstsucc is implemented by 


yptr = arcptr{x); 
ynode = (yptr == NULL) ? NULL : ndptr(yptr): ” 


Note that if e is the number of edges (arcs) in the graph and 7 the number of graph 
nodes, e/n is the average number of arcs emanating from a given node. Traversing the 
Successors of a particular node by this method is therefore O(¢/n) on the average. If the 
graph is sparse (that is, very few of the n` possible edges exinij, this is a significant 
advantage of the adjacency list representacicn. 

3. If a node has more than one predecessor, it is necessarily encountered more than 
once during a traversal. Therefore. to ensure termination and to ensure that each node 
is visited only once, a traversal elgorithm must check that a node being encountered 
has not been visited previously. There are two ways to do this. One is to maintain a set 
of visited nodes. The set would be maintained for efficient icorup and insertion as a 
search tree or a hash table. Whenever a node 1s encountered, ine table is searched to 
see if the node has already been visited. If it has, the node is ignored: i it has not. the 
node is visited and added to the table. Of course, the lookup and insertion add to the 
traversal overhead. 

The second technique is to keep a flag visited(nd) in each node. Initially, all flags 
are set off (false) via a quick nongraph traversal.through the list of graph nodes. The visit 
routine turns the flag on (rrue) in the node being visited. When a node is encountered. its 
flag is examined. If it is on, the node is ignored: if it is off. the node is visited-and the flag 
is set on. The flagging technique is used more commonly. since the flag initialization 
overhead is less than the table lookup and maintenance overhead. 


Spanning Forests 


A forest may be defined as an acyclic graph in which every node has one or no 
predecessors. A tree may be defined as a forest in which only a single node (called the 
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root) has no predecessors. Any forest consists of a collection of trees. An ordered forest 


is one whose component trees are ordered. Given a graph G, F is a spanning forest of 
Gif . 


1. F is a subgraph of G containing all the nodes of G. 
2.. F is an ordered forest containing trees T), T2, ... » Tn- 


3. T; contains all nodes that are reachable in G from the root of T; and are not con- 
tained in Tj for some j < i. 


F is a spänning tree of G if it is a spanning forest of G and consists of a single tree. 

Figure 8.4.1 illustrates four spanning forests for the graph of Figure 8.1.3. In 
each forest the arcs of the graph that are not included in the forest are shown as 
dotted arrows, and the arcs included in the forest are solid arrows. The spanning 
forests of Figure 8.4.1a and b are spanning trees, whereas those of Figure 8.4.1c and 
d are not. 

Any spanning tree divides the edges (arcs) of a graph into four distinct groups: 
tree edges, forward edges, cross edges, and back edges. Tree edges are arcs of the 
graph zhat are included in the spanning forest. Forward edges are arcs of the graph 
from a node to a spanning forest nonson descendant. A cross edge is an arc from one 
node to another node that is not the first node’s descendant or ancestor in the spanning 
forest. Back edges are arcs from a node to a spanning forest ancestor. The following 
table classifies the arcs of the graph of Figure 8.1.3 in relation to each of the spanning 
trees of Figure 8.4.1: 


Arc (a) (b) (c) (d) 
e ŘŮŐ——m 
<A.C> tree iree cross cross 
<A,D> tree tree tree tree 
<B,E> tree tree tree tree 
<B,F > tree cross tree cross 
<B,H> tree tree tree tree 
ZE GS tree tree cross tree 
<F,G> back back tree back 
<G,B> tree ; tree back tree 
<G.F> forward tree back tree 
<H,G> back back Cross back 


Consider a traversal method that visits all nodes reachable from a previously 
visited node before visiting any node not reachable from a previously visited node. 
In such a traversal a node is visited either arbitrarily or as the successor of a pre- 
viously visited node.’ The traversal defines a spanning forest in which an arbitrarily 
selected node ‘is the root of a tree in the spanning forest. and in which a node nl se- 
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lected as the successor of n2 is a son of n2 in the spanning forest. Fer example. the 
traversal ACGBEFHD defines the forest of Figure 8.4.1a, and the traversal BEFGH- 
CAD defines that of Figure 8.4.1¢. Although a particular traversal defines’a single 
spanning forest, a number of traversals may define the same forest. For example, 
ACDGBEFH also defines the spanning forest of Figure 8.4. la. 


Sec, 8.4 Graph Traversal and Spanning Forests 565 


Undirected Graphs and Their Traversais 


Thus far we have only considered directed graphs. An undirected graph may be 
considered a symmetric directed graph, that is. one in which an arc <B,A> must exist 
whenever an arc <A,B> exists. The undirected arc (A, B) represents the two directed 
arcs <A,B> and <B,A>. 

An undirected graph may therefore be represented as a directed graph using either 
the adjacency matrix or adjacency list method. An adjacency matrix representing an 
undirected graph must be symmetric; the values in row i, column j and in row j, column 
i must be either both false [that is, the arc (i, j) does not exist in the graph} or both rrue 
[the arc (i, j) does exist}. In the adjacency list representation, if (i, J) is an undirected 
arc, the arc list emanating from node(i) contains a list node representing directed arc 
“ij ang the list emanating from node( j) contains a list node representing directed 
arc <j,i>. In an undirected graph, if a node x is reachable from a node y (that is, there 
is a path from y to x), v is reachable from x as well along the reversed path. 

Since an undirected graph is represented by a directed graph, any traversal method 
for directed graphs induces a traversal method for undirected graphs as well. Figure 
8.4.2 illustrates an undirected graph and two spanning trees for that graph. The tree in 
Figure 8.4.2b is created by either of the traversals ABEFKGCDHIJ or ABEFGKCHDIJ, 
among others. The tree in Figure 8.4.2c is created by the traversals ABEFGKDCJHI or 
ABDJECHIFGK, among others. Note that the edges included and excluded from the 
spanning tree are all bidirectional. 

Spanning forests constructed by undirected graph traversals have several special 
properties. First. there is no distinction between forward edges (or tree edges) and back 
edges. Since an edge in an undirected graph is bidirectional, such a distinction is mean- 
ingless. In an undirected graph, any arc between a node and its nonson descendant is 
called a back edge. A 

Second, in an undirected graph, all cross edges are within a single tree. Cross 
edges between trees arise in a directed graph traversal when there is an arc <x, y> such 
that y is visited before x and there is no path from y to x. Therefore. the arc <x,y> is 
a cross edge. In an undirected graph containing an arc (x, y), x and y must be part of 
the same tree. since each is reachable from the other via that arc at least. A cross edge 
in an undirected graph is possible only if three nodes x, v, and z are part of a cycle and 
y and Z are in separate subtrees of a subtree whose root is x. The path between v and z 
must then include a cross edge between the two subtrees. Confirm that this is the case 
with all cross edges of Figure 8.4.2c. 

Because undirected graphs have “double” the edges of directed graphs, their span- 
ning forests tend to have fewer, but larger, trees. 

An undirected graph is termed connected if every node in it is reachable from ev- 
ery other. Pictorially. a connected graph has only one segment. For example, the graph 
of Figure 8.4.2a is a connected graph. The graph of Figure 8.1.la is not connected, 
since node Æ is not reachable from node C, for example. A connected component of 
an undirected graph is a connected subgraph containing all arcs incident to any of its - 
nodes such that no graph node outside the subgraph is reachable from any node in the 
subgraph. For example, the subgraph of Figure 8.1.1 has three connected components: 
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Figure 8.4.2 


nodes A,B,C,D, F; nodes E and G; and node H. A connected graph has a single connected 
component, > 

The spanaing forest of a connected graph is a spanning tree. Each tree in the 
spanning forest of an undirected graph contains all the nodes.in a single connected 
component ọf the graph. Thus any traversal method that creates a spanning forest (that 
is, one that visits all nodes reachable from visited nodes before visiting any other nodes) 
can be used to determine whether an undirected graph is connected and to identify an 
undirected graph’s connected components. 

In traversing an undirected graph, it is not very important which node s is used as 
the starting node or how select chooses an arbitrary node (except perhaps in terms of 
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the efficiency of select.) This is because all nodes of a connected component will wind 
up in the same tree regardless of the choice of s or how select operates. This is not true 
in traversing a directed graph. 

For example, the traversal of Figure 8.4.1a and b used s = A. Since all nodes 
are reachable from A, the spanning forest is a tree and select is never needed to choose 
an arbitrary node once that tree is built. In Figure 8.4.1c, however. B is the starting 
node; therefore only nodes reachable from B are included in the first tree. select then 
chooses C. Since only visited nodes are reachable from C, it is alone in its tree. selecris 
then required again to choose A, whose tree completes the traversal. In Figure 8.4.1d, 
s equals C and select is required only once, when it returns A. Thus, if it is desired to 
create as large and as few trees as possible, s should be a node with as few predecessors 
as possible (preferably none) and select should choose such a node as well. This may 
make select less efficient. 

We now examine two traversal methods and their applications to both directed 
and undirected graphs. 


Depth-First Traversal 


The depth-first traversal technique is best defined using an algorithm dfira- 
verse(s) that visits all nodes reachable from s. This algorithm is presented shortly. 
We assume an algorithm visit(nd) that visits a node nd and a function visited(nd) that 
returns TRUE if nd has already been visited and FALSE otherwise. This is best imple- 
mented by a flag in each node. visit sets the field to TRUE. To execute the traversal, the 
field is first set FALSE for all nodes. The traversal algorithm also assumes the function 
select with no parameters to select an arbitrary unvisited node. select returns null if all 
nodes have been visited. 


for (every node nd) 
visited(nd) = FALSE; 
S = a pointer to the starting node for the traversal; 
while (s != NULL) { 
dftraverse(s); 
s = select(); 
} /* end while */ 


Note that a starting node s is specified for the traversal. This node becomes the root of the ` 
first tree in the spanning forest. The following is a recursive algorithm for dftraverse(s), 
using the routines firstsucc and nextsuce presented earlier: 


/* visit all nodes reachable from s */ 
visit(s); 
/* traverse all unvisited successors of s */ 
firstsucc(s,yptr.nd); 
while (yptr != NULL) { 
if (visited(nd) == FALSE) 
dftraverse(nd); 
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nextsucc(s, yptr,nd); 
} /* end while */ 


If it is known that every node in the graph is reachable from the starting node 
s (as in the case of the graph of Figure 8.1.3 starting from node A or in the case of a 
connected undirected graph such as that of Figure 8.4.2a), the spanning forest is a single 
spanning tree and the while loop and select are not required in the traversal algorithm, 
since every node is visited in a single call to dftraverse. 

A depth-first traversal, as its name indicates, traverses a single path of the graph 
as far as it can go (that is, until it visits a node with no successors or a node all of whose 
successors have already been visited). It then resumes at the last node on the path just 
traversed that has an unvisited successor and begins traversing a new path emanating 
from that node. Spanning trees created by a depth-first traversal tend to be very deep. 
Depth-first traversal is also sometimes called depth-first search. 

Figure 8.4.1a and c are both depth-first spanning trees of the graph of Figure 8.1.3. 
In Figure 8.4. 1a, the traversal started at A and proceeded as follows: ACGBEFHD. Note 
that this is the preorder traversal of the spanning tree. In fact, the depth-first traversal 
of a tree is its preorder traversal. In Figure 8.4.1c, the traversal starts at B and pro- 
ceeds as follows: BEFGH. At that point, all nodes reachable from B have been visited; 
consequently select is called to find an arbitrary unvisited node. Figure 8.4.1c assumes 
that select returned a pointer to C. But no unvisited nodes are successors of C (G has 
already been visited); therefore select is called again and returns A. D is an unvisited 
successor of A and is visited to complete the traversal. Thus Figure 8.4. ke corresponds 
to the complete depth-first traversal BEFGHCAD. * 

This illustrates that there may be several depth-first traversals and depth-first 
spanning trees for a particular directed graph. The traversal depends very much on 
how the graph is represented (adjacency matrix or adjacency list), on how the nodes 
are numbered, on the starting node, and on how the basic depth-first traversal is im- 
. plemented (in particular, the implementation of firstsucc, nexisucc, and select). The 
essential feature of a depth-first traversal is that, after a node is visited. all descendants 
of the node are visited before its unvisited brothers. Figure 8.4.2b represents the depth- 
first traversal ABEFKGCDHLJ of the undirected graph of Figure 8.4.2a. 

As usual, a stack can be used to eliminate the recursion in depth-first traversal. 
The following is a complete nonrecursive depth-first traversal algorithm: 


for (every node nd) 
visited(nd) = FALSE; 
s = a pointer to the starting node for the traversal; 
ndstack = the empty stack; 
while (s != NULL) { 
visit(s); 
/* find first unvisited successor */ 
firstsucc(s,yptr,nd); 
while ((nd != NULL) & (visited(nd) == TRUE)) 
nextsucc(s, yptr, nd) ; 
/* if no unvisited successors, simulate return */ 
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i* from recursive call */ 
while ((nd == NULL) && (empty(ndstack) == FALSE)) { 
popsub(ndstack,s,yptr); 
/* find next unvisited successor */ 
nextsucc(s, yptr,nd); 
while ((nd != NULL) & (visited(nd) == TRUE)) 
nextsucc(s,yptr,nd); 
} /* end while ((nd == NULL) && ...) */ 
if (nd != NULL) { 
/* simulate the recursive call */, 
push(ndstack,s,yptr); 


s = select; 
} /* end while (s != NULL) */ 


Note that each stack element contains pointers to both a father node (s) and an incident 
arc or ifs son (yptr) to allow continuation of the traversal of the successors. 

To use this algorithm to construct a spanning tree, it is necessary to keep track of 
a node's father when it is visited, as follows. First, change the if statement at the end of 
the algorithm to 


if (nd != NULL) { 
/* simulace the recursive call */ 
push(ndstack,s,yptr); 


fas; /* this statement is are */ 
S$ = nd; 
} /* end if */ 
else { 
s = select(); 
f = NULL; /* this statement is added */ 


} ./* endif */ 


Second, initialize f to NULL at the beginning of the algorithm. ‘Third, change 
visit(s) to addson(f.s): visited(s) = TRUE, where addson adds node(s} to the tree as 
the next son of node(f). [Recall that visis) was defined to set visited(s) to TRUE.) If 
fis NULL. addson(f.s) adds node(s) as a new tree in the forest (for example, it calls 
maketree. It is assumed that the tree roots are kept in a linked list managed by maketree 
using two global variables pointing to the first and last trees in the forest.) 

You are invited to apply this modified algorithm to the graph of Figure 8.1.3, with 
s initialized to A and the successors of any node ordered alphabetically. to obtain the 
spanning tree of Figure 8.4.1a. Similarly, applying the modified algorithm to the same 
graph, with s initialized to B and assuming that select chooses C before A and D. and 
A before D, yields the spanning forest of Figure 8.4.1c. Applying the algorithm to the 
graph of Figure 8.4.2a produces the tree of Figure 8.2.4b. 
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As illustrated by Figure 8.4.2b. a depth-first spanning forest of an undirected 
graph may contain tree edges and back edges but cannot contain any cross edges. To 
see why, assume that (x, v) is an edge in the graph and that x is visited betore y-Ina 
depth-first traversal, y must be visited as a descendant of x before any nodes that are not 
reachable from x. Thus the arc (x. x) is either a tree edge or a back edge. The same is 
true in reverse if v is visited first, since the undirected arc (x, y) is equivalent to (y, x). 
In a directed graph, however, the arc <x,y> but not <y,x>.may be in the graph. If y 
is visited first, since x may not be reachable from y, x may not be in a subtree rooted 
at y, so the arc <x, y> may be a cross edge even in a depth-first spanning tree. This is 
illustrated by the arcs <A,C> and <C,G> in Figure 8.4.1c. 


Applications of Depth-First Traversal 


Depth-first traversal, like any other traversal method that creates a Spanning for- 
est. can be used to determine if an undirected graph is connected and to identify the 
connected components of an undirected graph. Whenever select is called. a new con- 
nected component of the graph is being traversed. If select is never called, the graph is 
connected. 

Depth-first traversal can also be used to determine if a graph is acyclic. In both 
directed and undirected graphs, a cycle exists if and only if a back edge exists in a 
depth-first spanning forest. It is obvious that if a back edge exists, the graph contains a - 
cycle formed by the back edge itself and the tree path Starting at the ancestor head of 
the back edge and.ending at the descendant tail of the back edge. To prove that a back’ 
" edge must exist in a cyclic graph, consider the node nd of a cycle that is the first node 
` in its cycle visited by a depth-first traversal. There must exist a node x such that the arc 
` (x, nd) or <x,nd> is in the cycle. Since x is in the cycle, it is reachable from nd, so that 
x must be a descendant of nd in the spanning forest. Thus the arc (x, nd) or <x,nd> is 
a back edge by definition. 

Therefore to determine if a graph is acyclic it is gnly necessary to determine that 
an edge encountered during a depth-first traversal is not a back edge. When considering 
an edge (s, nd) or <s,nd> in the depth-first traversal algorithm, the edge can be a back 
edge only if visited(nd) is true. In an undirected graph, where there are no cross edges 
in a depth-first traversal, (s, nd) is a back edge if and only if visited(nd) is true and 
nd ! = father(s) in the spanning forest. ; 

In a directed graph, <s,nd> can be a back edge even if nd == father(s), since 
<s.nd> and <nd,s> are distinct arcs. Thus in a directed graph a cycle may consist 
of only two nodes (such as s and nd), whereas in an undirected graph, at least three 
are required. However, since a directed graph’s spanning tree may contain cross edges 
as well as back edges, visited(nd) equaling true is not enough to detect a cycle. For 
example, cross edges <A,C>, <H,G> and <C,G> in Figure 8.4.1c are not part of 
z cycle, although C has been visited by the time <A,C> is considered, and G has 
been visited by the time <H,G> and <C.G> are considered. To determine that 2n arc 
<s.nd> is not a back edge when visited(nd) is true, it is necessary to consider each 
ancestor of s in turn to ensure that it does not equal nd. We leave the details of an 
aigorithm to determine if a directed graph is acyclic (that is, a dag) as an exercise for 
the reader. ; 
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In the preceding section we examined an algorithm to schedule tasks given a 
series of required precedences among those tasks. We saw that the precedence relations 
among the tasks can be represented by a dag. The algorithm presented in that section 
can be used to specify a linear ordering of the nodes in which no node comes before a 
preceding node. Such a linear ordering is called a topological sort of the nodes. 

A depth-first traversal can be used to produce a reverse topological ordering of 
the nodes. Consider the inorder traversal of the spanning forest formed by a depth- 
first traversal of a dag. We now prove that such an inorder traversal produces a reverse 
topological ordering. 

To repeat the recursive definition of inorder traversal of a forest from Section 5.5: 


1. Traverse the forest formed by the subtrees of the first tree in the forest, if any. 
2. Visit the root of the first tree. 
3. Traverse the forest formed by the remaining trees of the forest, if any. 


To differentiate in the following discussion between the depth-first traversal that cre- 
ates the forest and the inorder traversal of the forest, we refer to DF-visits (and a DF- 
traversal) and JO-visits (and an IO-traversal), respectively. 

An [O-traversal of the depth- -first spanning tree of a'dag must be in reverse topo- 
logical order. That is, if x precedes y, x is IO-visited after y. To see why this is so, con- 
sider the arc <x,y>. We show that y is 1O-visited before x. Since the graph is acyclic, 
<x,y> cannot be a back arc. If it is a tree’arc or a forward arc, so that y is a descen- 
dant of x in the spanning forest, y is IO-visited before x because an inorder traversal 
10-visits the root of a subtree after traversing all its subtrees. If <x,y> is a cross edge, 
y must have been 10-visited before x (otherwise, y would have been a descendant of 
x). Consider the smallest subtrees, S(x) and S(y), containing x and y respectively whose 
roots are brothers. (Roots of trees of the spanning forest are also considered brothers 
in this context.) Then since y was DF-visited before x, S(y) precedes S(x) in their sub- 
tree ordering. Thus S(y) is 1O-traversed before S(x), which means that y is IO-visited 
before x. 

Thus an algorithm to determine a reverse topological ordering of the nodes of a 
dag consists of a depth-first search of the dag followed by an inorder traversal of the re- 
sulting spanning forest. Fortunately, it is unnecessary to make a separate traversal of the 
spanning tree since an inorder traversal can be incorporated directly into the recursive 
depth-first traversal algorithm. To do this, simply push a node onto a stack when it is 
DF-visited. Whenever dftraverse returns, pop the stack and IO-visit the popped node. 
Since dftraverse DF-traverses all the subtrees of a tree before completing the tree’s 
DF-traversal and traverses the first subtree of a set of brothers before DF-traversing 
the others, this routine yields an 1O-traversal. The reader i is invited to implement this 
algorithm nonrecursively. 


Efficiency of Depth-First Traversal 
The depth-first traversal routine visits every node of a graph and traverses all 


the successors of each node. We have already seen that, for the adjacency matrix im- 
plementation, traversing all successors of a node using firstsucc and nextsucc is O(n). 
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where n is the number of graph nodes. Thus traversing the successors of all the nodes is 
O(n*). For this reason, depth- i he search using the adjacency matrix representation is 
O(n +n) (n node visits and n? possible successor examinations), which is the same 
as O(n‘). 

If the adjacency list representation is used, traversing all successors of all nodes 
is O(e), where eis the number of edges in the graph. Assuming that the graph nodes 
are organized as an array or a linked list, visiting all n nodes is O(n), so that the ef- 
ficiency of depth-first traversal using adjacency lists is O(n + e). Since e is usually 
much smaller than n?, the adjacency list representation yields more efficient traversals. 
(The difference, however, is somewhat offset by the fact that in an adjacency matrix, 
traversal of successors involves merely counting from l to n, whereas in an adjacency 
list, it involves successively accessiag fields in nodes.) Depth-first traversal is often 
considered O(e), since e is usually larger than n. 


Breadth-First Traversal 


An alternative traversal method, breadth-first traversal (or breadth-first search), 
visits all successors of a visited node before visiting any successors of any of those 
successors. This is in contradistinction to depth-first traversal, which visits the suc- 
cessors of a visited node before visiting any of its “brothers.” Whereas depth-first 
traversal tends to create very long, narrow trees, breadth-first traversal tends to create 
very wide, short trees. Figure 8.4.1b represents a breadth-first traversal of the graph 
of Figure 8.1.3, and Figure 8.4.2c represents a breadth- fipst traversal of the graph of 
Figure 8.4.2a. 

In implementing depth-first traversal. each visited node i is placed ona stack (either 
implicitly via recursion or explicitly), reflecting the fact that the last node visited is 
the first node whose successors will be visited. Breadth- first traversal is implemented 
using a queue, representing the fact that the first node visited is the first node whose 
successors are visited. The following is an algorithm bftraverse(s) to traverse a graph 
using breadth-first traversal beginning at node(s): 


ndqueue = the empty queue; 
while (s != NULL) { 
visit(s); 
insert(ndqueue, s); 
while (empty(ndqueuve) == FALSE) {. 

x = remove(ndqueue) ; 

/* visit all successors of x */ 

firstsucc(x, yptr, nd); 

while (nd != NULL) { 

» if (visited(nd) == FALSE) { 
visit(nd); 
insert(ndqueue, nd) ; 

} /t enbafit/ 4, 
nextsucc(x, yptr, nd); 

} /* end while */ 

} /* end while */ 
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s = select(); 
} /* end while */ 


We leave the modification of the algorithm to produce a breadth-first spanning 
forest as an exercise for the reader. Figure 8.4.1b illustrates a breadth-first spanning 
tree for the graph of Figure 8.1.3, representing the breadth-first traversal ACDGBFEH. 
Note that although the traversal differs significantly from the depth-first traversal ACG- 
BEFHD that produced the spanning tree of Fi gure 8.4. la, the two spanning trees them- 
selves do not differ except for the position of node F. This reflects the fact that the graph 
of Figure 8.1.3 has relatively few arcs (ten) compared with the total number of potential 
arcs (n? = 64). In a.graph with more arcs, the difference in spanning forests is more 
pronounced. . 

A breadth-first spanning tree does not have any forward edges, since all nodes 
adjacent to a visited nade nd have already been visited or are spanning tree sons of 
nd. For the same reason, for a directed graph, all cross edges within the same tree 
are to nodes on the same or higher levels of the tree. For an undirected graph, a 
breadth-first spanning forest contains no back edges. since every back edge is also a 
forward edge. 

Breadth-tirst traversal can be used for some of the same applications as depth- 
first traversal. In particular, breadth-first traversal can be used to determine if an 
undirected graph is connected and to identify the graph’s connected components. 
Breadth-first traversal can also be uscd to determine if a graph is cyclic. For a directed 
graph. this is detected when a back edge is found: for an undirected graph, it is detected 
when a cross edge within the same tree is found. 

For an unweighied graph, breadth-first traversal can also be used to find the short- 
est path (fewest arcs) from one node to another. Simply begin the traversal at the first 
node and stop when the target node has been reached. The breadth-first spanning tree 
path from the root to the target is the shortest path between the two nodes. 

The efficiency of breadth-first traversal is the same as that of depth-first traversal: 
each node is visited once and all arcs emanating from every node are considered. Thus 
its efficiency is O(n?) for the adjacency matrix graph representation and O(n + e) for 
the adjacency list graph representation. 


Minimum Spanning Trees 


Given a connected weighted graph G, it is often desired to create a spanning tree 
T for G such that the sum of the weights of the tree edges in T is as small as possible. 
Such a tree is called a minimum Spanning tree and represents the “cheapest” way of 
connecting all the nodes in G. 

There are a number of techniques for creating a minimue spanning tree for a 
weighted graph. The first of these, Prim's algorithm. discovered independently by Prim 
and Dijkstra, is very much like Dijkstra’s algorithm for finding shortest paths. An ar- 
bitrary node is chosen initially as the tree root (note that in an undirected graph and its 
spanning tree, any node can be considered the tree root and the nodes adjacent to it as 
its sons). The nodes of the graph are then appended to the tree one at a time until all 
nodes of the graph are included. 
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The node of-the graph added to th: tree at each point is that node adjacent to a 
node of the tree by an arc of minimum weight. The arc of minimum weight becomes a 
tree arc connecting the flew node to the tree. When all the nodes of the graph have been 
added to the tree, a minimum spanning tree has been constructe | for the graph. 

To see that this technique creates a minimum spanning tree, consider a minimum 
spanning tree T for the graph and consider the partial tree PT built by Prim’s algorithm 
at any point. Suppose that (a.b) is the minimum-cost arc from nodes in PT to nodes not 
in PT, and suppose that (a,b) is not in T. Then, since there is a path between any two 
graph nodes in a spanning tree, there must be an alternate path between a and b in T 
that does not include arc (a.b). This aiternate path P must include an arc (x.v) from a 
node in PT to a node outside of PT. Let us assume that P contains subpaths between a 
and x and between y and b. : 

Now ‘consider what would happen if we replaced arc (x, y) in T with (a, b) to 
create NT. We claim that NT is also a spanning tree. To prove this, we need to show 
two things: that any two nodes of the graph are connected in NT and that NT does not 
contain a cycle—that is, that there is only one path between any two nodes in NT. 

Since T is a spanning tree. any two nodes, m and n, are connected in T. If the 
path between them in T does not contain (x,y), the same path connects them in NT. If 
the path between them in 7 does contain (x,y), consider the path in NT formed by the 
subpath in T from m to x, the subpa i in P (which is in T) from x to a. the arc (a,b), the 
subpath in P from b to y, and the subpath in T from y to n. This is a path from m to n in 
NT. Thus any two nodes of the graph are connected in NT. 

To show that NT does not contain a cycle, suppose that it did. If the cycle does 
not contain (a,b), the same cycle would exist in T. But that is impossible, since T is a 
spanning tree. Thus the cycle must contain (a,b). Now consider the same cycle with arc 
(a,b) replaced with the subpath of P between a and x, the arc (x.y), and the subpath ` 
P between y and b. The resulting path must also be a cycle and is a path entirely in 
But, again. T cannot contain a cycle. Therefore NT also does not contain a cycle. 

NT has thus been shown to be a spanning tree. But NT must have lower cost than T, 
since (a,b) was chosen to have lower cost than (x,y). Thus T'is not a minimum spanning 
tree unless it includes the lowest weight arc from PT to nodes outside PT. Therefore 
any arc added by Prim’s algorithm must be part of a minimum spanning tree. 

The crux of the algorithm is a method for efficient determination of the “clos- 
est” node to a partial spanning tree. Initially, when the partial tree consists of a single 
root node, the distance of any other node nd from the tree, distanceind}, is equal to 
weight(root.nd). When a new node, current, is added to the tree, distance|nd] is mod- 
ified to the minimum of distance[nd] and weight(current.nd). The node added to the 
tree at each point is the node whose distance is lowest. For nodes md in the tree. dis- 
tance[tnd] is set to infinity, so that a node outside the tree is chosen as closest. An 
additional array closest{nd] points to the node in the tree such that distance{[nd| = 
weighticlosest{nd},ud). that is, the node in the tree closest to nd. If two nodes, x and y. 
are not adjacent, veight(x,y) is also infinity. i 

Prim’s algorithm may therefore be implemented as follows: 


root = an arbitrary node chosen as root; 
for (every node nd in the graph) { 
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distance[nd] = weight(root,nd) ; 
closest[nd] = root; 
d } /* end for */ 
distance[root] = INFINITY; 
current = root; 
for (i = 1; i < number of nodes in the graph; ++7) { 
/* find the node closest to the tree */ 
mindist = INFINITY; 
for (every node nd in the graph) 
if (distance[nd] < mindist) { 
current = nd; 
mindist = distance[nd]; 
} /* end if */ 
/* add the closest node to the tree */ 
j“ and adjust distances yi 
addsof(closest[current] ,current) ; 
distance(current] = INFINITY; 
for (every node nd adjacent to current) 
, if.((distance[nd] < INFINITY) 

. && (weight(current,nd) < distance[nd])) { 
distance[nd] = weight(current,nd); 
closest[nd] = current; 

} /* end if */ 
} /* end for */ 


If the graph is represented by an adjacency matrix, each for loop in Prim’s algo- 
rithm must examine O(n) nodes. Since the algorithm contains a nested for loop, it is 
O(n‘). 

However. just as in Dijkstra’s algorithm, Prim’s algorithm can be made more 
efficient by maintaining the graph using adjacency lists and keeping a priority queue of 
the nodes not in the partial tree. The first inner loop [for (every node nd in the graph) ...] 
can then be replaced by removing the minimum-distance node from the priority queue 
and adjusting the priority queue. The second inner loop simply traverses an adjacency 
list and adjusts the position of any nodes whose distance is modified in the priority 
queue. Under this implementation, Prim’s algorithm is O((n: + e) log n). 


Kruskal’s Algorithm 


Another algorithm to create a minimum spanning tree is attributable to Kruskal. 
The nodes of the graph are initially considered as n distinct partial trees with one node- 
each. At each step of the algorithm, two distinct partial trees are connected into a single 
partial tree by an edge of the graph. When only one partial tree exists (after n — 1 such 
steps), it is a minimum spanning tree. 

The issue of Course is what connecting arc to use at each step. The answer is to 
use the arc of minimum cost that connects two distinct trees. To do this, the arcs can be 
placed in a priority queue based on weight. The arc of lowest weight is then examined 
to see if it connects two distinct trees. 
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To determine if an arc (x,y) connects distinct trees, we can implement the trees 
_ with a father field in each node. Then we can traverse all ancestors of x and y to obtain 
the roots of the trees, containing them. If the roots of the two trees are the same node, 
x and y are already in the same tree, arc (x,y) is discarded, and the arc of next lowest 
weight is examined. Combining two trees simply involves setting the father of the root 
of one to the root of the other. 

We leave the actual algorithm and its C implementation for the reader. Forming 
the initial priority queue is O(e log e). Removing the minimum-weight arc and adjusting 
the priority queue is O(lag e). Locating the root of a tree is O(log n). Initial formation 
of the n trees is O(n). Thus, assuming that n < e, as is true of most graphs, Kruskal’s 
algorithm is O(e log e). 


Round-Robin Algorithm 


Still another algorithm, attributable to Tarjan and Cheriton, provides even better 
performance when the number of edges is low. “he algorithm is similar to Kruskal’s 
except that there is a priority queue of arcs associated with each partial tree, rather than 
one global priority queue of all unexamined arcs. 

All partial trees are maintained in a queue, Q. Associated with each partial tree, 7, 
is a priority queue, P(T), of all arcs with exactly one incident node in the tree, ordered by 
the weights of the arcs. Initially, as in Kruskal’s algorithm, each node is a partial tree. A 
priority queue of all arcs incident to nd is created for each node nd, and the single-node 
trees are inserted into Q in arbitrary order. The algorithm proceeds by removing a partial 
tree, T}, from the front of Q; finding the minimum-weight arc a in P(T1); deleting from 
Q the tree, T2, at the other end of arc a; combining T1 and 72 into a single new tree 
T3 [and at the same time combining P(T1) and P(T2), with a deleted, into P(T3)]; and 
adding 73 to the rear of Q. This continues until Q contains a single tree: the minimum 
spanning tree. 

It can be shown that this round-robin al 7>rithm requires only O(e log log n) op- 
erations if an appropriate implementation of the priority queues is used. 


EXERCISES 


8.4.1. Consider the following nonrecursive depth-first traversal algorithm: 


‘or (every node nd) 
visited(i) = FALSE; 
S = a pointer to the starting node of the traversal; 
ndstack = the empty stack; 
while (s '= NULL) { 
push(ndstack,s); 
while (empty(ndstack) == FALSE) { 
x = pop(ndstack); 
if (visited(x) == FALSE) { 
visit(x); 


Exercises 577 


8.4.2. 
8.4.3. 
8.4.4. 
8.4.5. 


8.4.6. 
8.4.7. 


8.4.8. 
8.4.9. 
8.4.10. 


8.4.11. 
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firstsucc(x, yptr,nd); 
while(nd != NULL) { 
if-€visited(nd) == FALSE) 
push(ndstack, nd); 
nextsucc(x, yptr,nd); 
} /* end while */ 
} /* end if */ 
} /* end while */ 
s = select(); 
} /* end while */ 


(a) Apply the algorithm to the graphs of Figures 8.1.3 and 8.4.2a to determine the 
order in which the nodes are visited, if the successors of a node are assumed 
ordered in alphabetical order. 

(b) Draw the spanning trees induced by the traversal on each of the graphs. Modify 
the algorithm to construct a depth-first spanning forest. 

(c) Would a modified ordering of successors produce the spanning trees of Figures 
8.4.1a and 8.4.2b using this algorithm? Would a modified ordering produce the 
spanning trees in (b) using the algorithm of the text? 

(d) Is either algorithm preferable? 

Write an algorithm and a C program to determine if a directed graph is a dag. 

Write a recursive C program to print the nodes of a dag in reverse topological order. 

Write a nonrecursive C program to print the nodes of a dag in reverse topological order 

A node ndin a connected graph is an articulation point if removing nd and all arcs 

adjacent to nd results in an unconnected graph. Thus the “connectedness” of the graph 

depends on nd. A graph with no articulation points is called biconnected. 


(a) Show that the root of the depth-first spanning tree of a biconnected graph has 
only a single son. 


(b) Show that if nd is not the root of a depth-first spanning tree f, nd is an articulation 


point if and only if ¢ does not contain a back edge from a descendant of nd to an 
ancestor of nd. 


(c) Modify the recursive depth-first traversal algorithm to determine if a connected 
graph is biconnecied, 


Write a C routine to create a breadth-first spanning forest of a graph. 


Write C routines that use a breadth-first traversal to determine if a directed and an 
undirected graph are cyclic. f 


Write a C routine to produce the shortest path from node x to node y in an unweighted 
graph, if a path exists, or an indication that no path exists between the two nodes. 


Show that the algorithms to find a cycle using depth-first or breadth-first search must 
be O(n). 


Implement Prim’s algorithm using an adjacency matrix and using adjacency lists and 
a priority queue. 


Implement Kruskal’s algorithm as a C routine. 
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Storage Management 


A programming language that incorporates a large number of data structures must 
contain mechanisms for managing those structures and for controlling how storage is 
assigned to them. The previous chapters of this book illustrated some of those man- 
agement techniques. As data structures become more complex and provide greater 
capabilities to the user, the management techniques grow in complexity as well. 

In this chapter we look at several techniques for implementing dynamic alloca- 
tion and freeing of storage. Most of these methods are used in some form by operating 
systems to grant or deny user program requests. Others are used directly by individual 
language processors. We begin by expanding the concept of a list. 


9.1 GENERAL LISTS 


In Chapter 4 and in Section 7.1 we examined linked lists as a concrete data structure and 
as a method of implementation for such abstract data types as the stack, the queue, the 
priority queue, and the table. In those implementations a list always contained elements 
of the same type. 

It is also possible to view a list as an abstract data type in its own right. As 
an abstract data type, a list is simply a sequence of objects called elements. Associ- 
ated with each list element is a value. We make a very specific distinction between an 
element, which is an object as pari of a list, and the element’s value, which is the object 
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considered individually. For example, the number 5 may appear on a list twice. Each 
appearance is a distinct element of the list, but the values of the two elements—the 
number 5—are the same. An element may be viewed as corresponding to a node in the 
linked list implementation, whereas a value corresponds to the node’s contents. Note 
that the phrase “linked list” refers to the linked implementation of the abstract data type 
“list.” 

There is also no reason to assume that the elements of a list must be of the same 
type. Figure 9.1.1 illustrates a linked list implementation of an abstract list list ‘that 
contains both integers ‘and characters. The elements of that list are 5, 12, ‘s’, 147, and 
‘a’. The pointer list] is called an external pointer to the list (because it is not contained 
within a list node), whereas the other pointers in the list are internal pointers (because 
they are contained within list nodes). We often reference a linked list by an external 
pointer to it. 

It is also not necessary that a list contain only “simple” elements ( for example, 
integers or characters); it is possible for one or more of the elements of a list to them- 
selves be lists. The simplest way to implement a list element e whose value is itself a 
list J, ig by representing the element by a node containing a pointer to the linked list 
implementation of I. 

For example, consider the list list2 of Figure 9.1.2. This list contains four ele- 
ments. Two of these are integers (the first element is the integer 5; the third element 
is the integer 2) and the other two are lists. The list that is the second element of list2 
contains five elements, three of which are integers (the first, second and fifth elements) 
and two of which are lists (the third element is a list containing the integers 14, 9, and 
3 and the fourth element is the null list [the list with no elements]). The fourth element 
of list2 is a list containing the three integers 6, 3, and 10. 

There is a convenient notation for specifying abstract general lists. A list may 
be denoted by a parenthesized enumeration of its elements separated by commas. For 
example, the abstract list represented by Figure 9.1.1 may be denoted by 


list) = (5, 12, ‘s’, 147, ‘a’) 


The null list is denoted by an empty parenthesis pair [such as ()]. Thus the list of Figure 
9.1.2 may be denoted by 


list2 = (5, (3, 2, (14, 9, 3), (), 4). 2, (6, 3, 10) 


We now define a number of abstract operations on lists. For now, we are concerned 
only with the definition and logical properties of the operations, we consider methods 
of implementing the operations later in this section. If list is a nonempty list, head(list) 
is defined as the value of the first element of list. If list is a nonempty list, tail(list) 
is defined as the list obtained by removing the first element of list. If list is the empty list, 


wre fs [fe] 4 


Figure 9.1.1 List of integers and characters. 
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Figure 9.1.2 


head(list) and tail(list) are not defined. For a general list, head(list) may be either a list 
(if the value of the first list element is itself a list) or a simple data item; tail(list) must 


) be a (possibly null) list. 
For example, if list! and /ist2 are as in Figures 9.1.1 and 9.1.2, 


list] = (5, 12, ‘s`, 147, ʻa’) 

head(list!) = 5 

tail(list!) = (12, ‘s`, 147, ʻa’) 
head(tail(listl)) = 12 

tail(iail(list!)) = C's’, 147, ʻa’) 

list2 = (5,(3,2,(14.9,3).0,4),2.(6.3,10)) 
tail(list2) = ((3,2.(14,9,3).0,4),2,(6,3.10)) 
head(tail(list2)) = (3,2,(14,9,3),0.4) 
head(head(tail(list2))) = 3 


The head and tail operations are not defined if their argument is not a list. A sublist 
of a list / is a list that results from the application of zero or more tail operations to l. 

The operation first(list) returns the first element of list list. (If list is empty, 
Jirst(list) is a special null element, which we denote by nullelt.) The operation info(elt) 
returns the value of the list element e/t. The head operation produces a value, whereas 
the first operation produces an element. In fact, head(list) equals info(first(list)). Fi- 
nally, the operation next(elt) returns the element that follows the element elt on its list. 
This definition presupposes that an element can have only one follower. 

The operation nodetype(elt) accepts a'list element elt and returns an indication of 
the type of the element's value. Recall that, in the linked list implementation, an element 
is represented by a node. Thus if the enumerated constants ch, intgr, and [st represent 
the types character, integer, and list, respectively, nodetvpe(first(list])) equals intgr, 
nodetype(first(tail(tail(list})))) equals ch, and nodetype(first(tail(list2))) equals Ist. 
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Operations That Modify a List 


head, tail, first, info, next, and nodetype extract information from lists already in 
existence. We now consider operations that build and modify lists. 

Recall the push operation of Chapter 2 and its list implementation in Section 4.2. 
If list points to a list, the operation push(list, x) adds an element with value x to the front 
of the list. To illustrate the use of the push operation in constructing lists, consider the 
list (5,10,8), which can be constructed by the operations: 


list = nul]; 
push(list,8); 
push(Tist,10); 
push(list,5); 


Note that the abstract push operation changes the value of its first parameter to 
the newly created list. We introduce as a new operation the function 


addon(list, x) 


which returns a new list that has x as its head and Jisr as its tail. For example, if 
11 = (3,4,7), the operation 


12 = addon(11,5); 


creates a new list /2 equal to (5,3,4,7). The crucial difference between push and addon _ 
is that push changes the value of its first parameter, while addon does not. Thus, in the 
foregoing example, l1 retains the value (3,4,7). The operation push(list, x) is equivalent 
to list = addon(list, x). Since addon is more flexible than push, and since push is usually ` 
used only in connection with stacks, we henceforth use addon exclusively. 

Two other operations used to modify lists are setinfo and setnext. Setinfo(elt, x) 
changes the value of a list element elt to the value x. Thus we may write 
setinfo(first(list),x) to reset the value of the first element of the list list to x. This 
operation is often abbreviated sethead(list, x). For example, if list equals (5,10,8), the 
operation 


sethead( list, 18) 


changes list to (18, 10, 8), and the operation sethead(list,(5,7,3,4)) changes list to 
((5,7.3,4), 10,8). 

sethead is called the “inverse head operation” for an obvious reason. After per- 
forming the operation sethead(list, x), the value of head(list) is x. Note that 
sethead (list, x) is equivalent to 


list = addon(tail(list),x); 
The operation semnext(elt], elt2) is somewhat more complex. It modifies the list 


containing e/tl, so that next(eltl) = elt2. elti cannoy be the null element. next(elt2) 
is unchanged. Also if next(elt3) had been equal to elt2 before execution of 
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setnext(eltl,elt2), next(elt3) still equals elt2 after its execution, so that both next(eltl) 
and next(elt3) equal elt2. In effect, elt2 has become an element of two lists. The oper- 
ation settail(list|,list2) is defined as setnext(first(list1), first(list2)) and sets the tail of 
list! to list2. settail is sometimes called the inverse tail operation. - 

For example, if list = (5,9,3,7,8,6), settail(list, (8)) changes the value of list to 
(5,8), and settail(list, (4,2,7)) changes its value to (5,4,2,7). Note that the operation 
settail (list, I) is equivalent to list = addon(l, head(list)). i 


! 


Examples 


Let us look at some simple examples of algorithms that use these operations. 
The first example is an algorithm to add 1 to every integer that is an element of a 
list list. Character or list elements remain unchanged. 


p = first(list); 
while (p != nullelt) { 
if (nodetype(p) == INTGR) 
setinfo(p, info(p) + 1); 
p = next(p); 
} /* end while */ 


. The second example involves deletions. We wish to delete from alist list any char- 
acter element whose value is ‘w’. (Compare this example with the routine of Section 
4.2). One possible solution is as follows: 


q = nullelt; 
p = first(list); 
while (p != nullelt) 
if (info(p) == 'w') { 
/* remove node(p) from the list */ 
p = next(p); 
if (q == nullelt) 
list = tail(list); 
else 
setnext(q,p); 
} -/* end if */ 
else { 
q= pi 
p = next(p); 
} /* end else */ 


Before looking at a more complex example, we define a new term. An element 
(or a node) n is accessible from a list (or an external pointer) / if there is a sequence of 
heat and tail operations that, if applied to J, yields-a list with n as its first element. For 
example, in Figure 9.1.2 the node containing 14 is accessible from /ist2, since it is the 
first element of tail(tail(head(tail(list2)))). In fact, all the nodes shown in that figure 
are accessible from /ist2. When a node is removed from a list, it becomes inaccessible 
from the external pointer to that list. 
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Now suppose that we wish to increase by | the value in every integer node ac- 
cessible from a given list pointer list. We cannot simply traverse list, since it is also 
necessary to traverse all lists that are elements of list, as well as all lists that may be el- 
ements of elements of list, and so forth. One tentative solution is the following recursive 


algorithm addone2(list). 


p = first(list); 
while (p != nullelt) { 
if (nodetype(p) == INTGR) 
setinfo(p, info(p) + 1); 
else 
if (nodetype(p) == Ist) 
addone2(info(p)); 


p = next(p); 
} /* end while */ 


It is simple to remove the recursion and use a stack explicitly. 
Linked List Representation of a List 


As previously noted, the abstract concept of a list is usually implemented by a 
linked list of nodes. Each element of the abstract list corresponds to a node of the linked 
list. Each node contains fields info and next, whose contents correspond to the abstract 
list operations info and next. The abstract concepts of a “list” and an “element” are 
both represented by a pointer: a list by an external pointer to the first node of a linked 
list and an element by a pointer to a node. Thus, a pointer to a node nd in a list, which 
represents a list element, also represents the sublist formed by the elements represented 
by the nodes from nd to the end of the list. The value of an element corresponds to the 
contents of the info field of a node. 

Under this implementation the abstract operation first(list), which returns the 
first element of a list, is meaningless. If list is a pointer that represents a list, it 
points to the first node of a linked list. That pointer therefore also represents the 
first element of the list. Since first(list) and list are equivalent, head(list), which 
is equivalent to info(first(list)), is equivalent to info(list). sethead(list,x), defined 
as setinfo(first(list),x), is equivalent to setinfo(list.x). Similarly, settait(list1 ,list2), 
defined as setnext(first(list1),first(list2)), is equivalent to setnext(list] list2) under the 


linked list representation., — 
The operation sefinfo(eltx) is implemented by the assignment: statement 


info(elt) = x, where elt is a pointer to a node, and the operation setnext(cltl,elt2) 
by the assignment next(eltl) = elt2. Since a list is represented by a pointer to its first 
node, an element of a list that is itself a list is represented by a pointer to the list in the 
info field of the node representing the element. 

We defer a discussion of the implementation of nodetype until we present the C 
implementation of general lists later in this section. 

There are two methods of implementing the addon and tail operations. Con- 
sider the list /1 = (3,4,7) and the operation /2 = addon(I1,5). The two possible ways 
of implementing this operation are illustrated in Figure 9.1 .3a and b. Inthe first method, 
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(b) The Copy Method 


Figure 9.1.3 


a 

called the pointer method, the list (3,4,7) is represented by a pointer to it, /1. To create 
the list /2, a node containing 5 is allocated and the value of /1 is placed in its next field. 
Thus the list /1 becomes a sublist of /2. The nodes of list /I are used in two contexts: 
as part of list /1 and of list /2. In the second method, called the copy method, the list 
(3,4,7) is copied before the new list element is added to it. /1 still points to the original 
version, and the new copy is made a sublist of /2. The copy method ensures that a node 
appears in only one context. 

The difference between these methods becomes apparent when we attempt to 
perform the operation 


sethead(11,7) 


The resulting lists are shown in Figure 9.1.4a and b. If the copy method is used, a change 
in list /1 does not affect list /2 (Figures 9.1.3b and 9.1.4b). If the pointer method is used, 
any subsequent change in list ^} also modifies /2 (Figures 9.1.3a and 9.1.4a). 

The tail operation can also be implemented by either the pointer method or the 
copy method. Under the pointer method, tai/(/ist) returns a pointer to the second node 
in the input list. After a statement such as / = tail(/ist), the pointer list still points to the 
first node, and all nodes from the second on are on both list list and list /. Under a strict 
copy method, a new list would be created containing copies of all nodes from the second 
list node onward, and / would point to that new list. Here, too, if the pointer method is 
used, then a subsequent change in either the input list (ist) or the output list (/) causes a 
change in the other list. If the copy method is used, the two lists are independent. Note 
that, under the pointer method, the operation fail(list) is equivalent to next(/ist): both 
return the pointer in the next field of the node pointed to by the pointer /ist. Under the 
copy method however, an entirely new list is created by the sail operation. 
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Figure 9.1.4 


The operation of the copy method is similar to an operation of the assignment 
statement a = b so'that a subsequent change in b does not change a. This is because the 
assignment statement copies the contents of location b (the “value of b”) into location 
a. A change in the value of b changes only the copy in location b. Similarly, in the copy 
method, although /1 is a pointer, it really refers to the abstract list being represented. 
When a new list is formed from the old, the value of the old list is copied. The two lists 
are then entirely independent. In the pointer method, /1 refers to the nodes themselves 
rather than the list that they collectively represent. A change in one list modifies the 
contents of nodes that are ätso part of another. 

For reasons of efficiency, most list processing systems use the pointer method 
rather than the copy method. Imagine a 100-element list to which nodes ure constantly 
being added (using addon) and from which nodes are being deleted (using tail). The 
overhead involved in both time and space in allocating and copying 100 list nodes (not 
to mention any list nodes on lists that appear as elements) each time that an operation 
is performed is prohibitive. Under the pointer method, the number of operations in- 
volved in adding or deleting an element is independent of the list size, since it involves 
only modification of a few pointers. However, in exchange for this efficiency, the user 
must be aware of possible changes to other lists. When the pointer method is used, it is 
common for list nodes to be used in more than one context. 

In list processing systems that use the pointer method, an explicit copy operation 
is provided. The function 


copy(list) 


copies the list pointed to by ist (including all list elements) and returns a pointer to the 
new copy. The user can use this operation to ensure that a subsequent modification to 
one list does not affect another. 
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Representation of Lists 


So far we have ignored a number of important implementation questions: When 
may list nodes be freed? When must new list nodes be allocated? How are list nodes 
allocated and freed? For example, settail appends a new list to the first element of a list. 

; But what happens to the previous tail of the list that was replaced? 

These questions are related to another question: What happens when a node (or 
a list) is an element ora sublist of more than one list? For example, suppose the list 
(4,5,3,8) occurs twice as an element of a list (that is, it is the information fìeld of two 
separate nodes of the list). One possibility is to maintain two copies of the list, as in 
Figure 9.1.5u..Or suppose that a list appears at the end of two lists as does (43,28) in 

+ Figure 9.1.5b. Although it is possible to duplicate each element whenever it appears, 
this often results in a needless waste of space. An alternative is to maintain the lists 
as in Figure 9.1.6, In this representation, a list appears only once, with all appropriate 
pointers pointing to the head of the single list. Under this method, a node is pointed to 
by more than one pointer. 

If this possibility is allowed, the recursive algorithm addone2 presented earlier 
to add I to each accessible element in a list does not work correctly. For example, if 
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Figure 9.1.5 
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Figure 9.1.6 


addone? is applied to list6 in Figure 9.1.6a, when 1 is added to the integer 4 as the first 
element of the second element of list6, the contents of that node are changed to 5. But 
the routine again adds 1 to that node, since the node is pointed to by the informatio 
field of another list element as well. Thus the fina! value in the node becomes 6 rather 
than 5. Similarly, the values of the nodes containing 5, 3, and 8 are changed to 7, 5, and 
10, respectively (why?). This is clearly incorrect. 

In general, modifying the contents of a node from 4 to 5 is equivalent to replacing 
the node containing 4 by a new node containing 5 and then freeing the node containing 
4. But this assumes, possibly erroneously, that the node containing 4 is no longer needed. 
Whenever the contents of a node are changed or a node is deleted, it is first necessary 
to ensure that the old value is no longer required. 

In Figure 9.1.6b, the list (43,28) appears as a sublist of both /ist7, which is 
(12,18,43,28), and list8, which is (47,59,16,43,28). Imagine the chaos that would 
result if an attempt were made to remove the third element of list7. 

One solution to this problem is to disallow use of the same node in more than one 
context. That is, lists should be constructed as in Figure 9.1.Sa rather than as in Figure 
9.1.6a. Then when a node is no longer needed in a particular context, it can be freed, 
since no other internal pointers point to it. 


crlist Operation 
Suppose that we want to create the list of Figure 9.1.6a. The following sequence 


of operations accomplishes this: 
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1 = null; 

] = addon(1,8); 

] = addon(1,3); 

1 = addon(1,5); 

1 = addon(1,4); 

list6 = null; 

list6 = addon(list6,5); 
list6 = addon(list6, 1); 
list6 = addon(list6,4); 
list6 = addon(list6, 1); 
list6 = addon(Tist6,5); 


Let us introduce the operation | = crlist(a,a2,...s4n), where each parameter is either a 
simple data item or a list pointer. This operation is defined as the sequence of statements: 


1 = null; à 
] = addon(l,an); 


1 = addon(1,a}); 

] = addon(1,a4); 
That is, crlist(ay,a2, ...,dn) creates the list (a1 42,- -- „an). Then the foregoing sequence 
of operations can be rewritten as l 


] = erlist(4,5,3,8); 
list6 = crlist(5,1,4,1,5)4 


Notice that this is not the same as the single operation 
listé = (5,crlist(4,5,3,8).4,crlist(4,5,3,8),5); 


which creates two distinct copies of the list (4,5,3,8): one as its second element and one 
as its fourth. 

We leave as an exercise for the reader the task of finding a sequence of list oper- 
ations that creates the lists /ist7 and list8 of Figure 9.1.6b. 

If the pointer method is used to implement list operations, it is possible to cre- 
ate recursive lists. These are lists that contain themselves as elements. For example, 
suppose the following operations are performed: 


1 = erlist(2,crlist(9,7),6,4); 
11 » tai7?(tail(1)); 
sethead(11,1); 

12 = head(tail(1)); 

12 = tail(12); 

sethead(12,1); 


Figure 9.1.7 illustrates the effect of each of these operations. At the end of the sequence 
(Figure 9.1.7e) the list Z contains itself as its third element. In addition, the second 
element of / is a list whose second element is / itself. 
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sethead (11, 1) 
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12 = head (tail (1)) 
Figure 9.1.7 
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12 = tail (12) 


(d) 


sethead (12, l) 


(e) 


Figure 9.1.7 (cont.) 


Use of List Headers 


In Chapter 4 list headers were introduced as a place to store global information 
about an entire list. In many general list Processing systems, header nodes are used for 
other purposes as well. We have already seen two ways of implementing general lists: 
the pointer method and the copy method. There is a third alternative, called the header 
method, that is widely used in list Processing systems. Under this method a header 
node is always placed at the beginning of any group of nodes that is to be consid- 
ered a list. In particular, an external pointer always points to a header node. Similarly, 
if a list /is an element of another list, there is a header node at the front of /. Figure 9.1.8 
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illustrates the list of Figure 9.1.2 using the header method. The- information portion ot 
a header node holds global information about the list (such as the number of nodes in 
it, or a pointer to its last node). In the figure this field is shaded. Note that a null list is 
now represented by a pointer to a header node containing a null pointer in its next field, 
rather than by the null pointer itself. 
Any parameter that represents a list must be implemented as a pointer to a header 

node for that list. Any function that returns a list must be implemented SO as to return a 
pointer to a header node. 

_ The header method is similar to the pointer method in that a list is represented by 
a pointer to it. However, the presence of the header nodé causes significant differences 
(as we noted earlier in Section 4.5 when we discussed linear, circular, and doubly linked 
lists with headers). For example, we made a distinction between the push and the addon 
operations. If / is a list pointer, the function addon(/.x) adds a node containing x to the 
list pointed to by /, without changing the value of /, and returns a pointer to the new 
node. push(/,x) changes the value of its parameter / to point to the new node. Under 
the header method, adding an element to a list involves inserting a node between the 
header and the first list node. Thus despite the fact that the value of / is not changed, 
the list that / represents has been altered. 


Freeing List Nodes 


Earlier in this section we saw that a node or a set of nodes could be an element 
. and/or a sublist of one or several lists. In such cases there is difficulty in determining 

when such a node can be modified or freed. Define a simple node as a node containing 
a simple data item (so that its info field does not contain a pointer). Generally, multiple 
use of simple nodes is not permitted. That is, operations on simple nodes are performed 
by the copy method rather than the pointer method. Thus any simple node deleted from 
a list can be freed immediately. 

However, the copy method is highly inefficient when applied to nodes whose 
values are lists. The pointer method is the more commonly used technique when ma- 
nipulating such nodes, Thus whenever a list is modified or deleted as an element or 
a sublist, it is necessary to consider the implications of the modification or freeing of 
the list on other lists that may contain it. The question of how to free a deleted list is 
compounded by the fact that lists may contain other lists as elements. If a particular list 
is freed, it may also be necessary to free all the lists that are elements of it; however if 
these lists are also elements of other lists, they cannot be freed. 

As an illustration of the complexity of the problem, consider /ist9 of Figure 9.1.9. 
The nodes in that figure are numbered arbitrarily so that we may refer to them easily in 
the text. 

Consider the operation 


list9 = null; 
Which nodes can be freed and which’must be retained? Clearly, the list nodes of Jist9 


(nodes 1,2,3,4} can be freed, since no other pointers reference them. Freeing node |! 
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Figure 9,1.9 


allows us to free nodes 11 and 12, since they too are accessed by no other pointers. 
Once node 11 is freed, can nodes 7 and 8 also be freed? Node 7 can be freed because 
cach of the nodes containing a Pointer to it (nudes 11 and 4) can be freed. However, 
node 8 cannot be freed, since list! 1 points to it. Jisrl} is an external pointer; therefore 
the node to which it points may still be needed elsewhere in the program. Since node 8 
is kept, nodes 9 and 10 must also be kept (even though node 12 is being freed). Finally, 
nodes 5 and 6 must be kept because of the external pointer list10. 

The problem to be addressed in the next section is how to determine algorithmi- 
cally which nodes are to be kept and which are to be freed. However, before considering 
possible solutions, let us consider how lists can be implemented in C and make some 
comments about list processing languages and their design, 


General Lists in C 


Because general list nodes can contain simple data elements of any type or point- 
ers to other lists, the most direct way to declare list nodes is by using unions. One 
possible implementation is as follows: 


fdefine INTGR t 
#define CH 2 
#define LST 3 
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struct nodetype { 


int utype; /* utype equals INTGR, CH, or LST */ 

union { 
int intgrinfo; /* utype =. INTGR */ 
char charinfo; /* utype= CH  +*/ 
struct nodetype *Istinfo; /* utype = LST */ 

} info; 

struct nodetype *next; 

HH 


typedef’ struct nodetype *NODEPTR; 


Each list node had three elementary fields: a flag (utype) to indicate the type of the 
information field, the actual information field and a pointer to the next node in the list. 
The operation nodetype( p) is implemented by simply referencing p — > utype. 

` The actual implementation of any list operation depends on whether the system 
in question is implemented using the pointer method, *he copy method, or the header 
method. We consider the pointer method. 

The fail operation always produces a pointer to a list (possibly the null pointer), 
assuming that its argument points to a valid list. Thus this operation may be imple- 
mented as a simple C function: 


NODEPTR tail(NODEPTR list) 


if (list == NULL) { 
printf("illegal tail operation"); 
exit(1); 
} 
else 
return(list->next); 
} /* end tail */ 


We might be tempted to implement the head operation in similar fashion, by sim- 
ply returning the value in the info field of the node to which the parameter points. 
However, because some versions of C do not allow a function to return a structure, we 
implement the head operation as a function with two parameters: an input parameter 
that holds a pointer to the input list and an output parameter that points to a structure 
containing the information. Let us assume that a structure has been declared 


struct { 
int utype; 
union { . 
int intgrinfo; 
char charinfo; 
struct nodetype *listinfo; 
} info; 
} infotype; 
typedef struct infotype *INFOPTR; 
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\ 
Then the function head, which is invoked by a statement such as head(listl. 
&item), is implemented as follows: 


void head(NODEPTR list, INFOPTR pitem) 
{ 


if (list == NULL) { 
printf ("illegal head operation"); 
exit(1); 
} /* end if */ 
pitem = (INFOPTR) malloc(sizeof (struct infotype)); 
pitem->utype = list->utype; 
pitem->info = list->info; 
return; 
} /* end head */ 


In some applications it may not be necessary to return the value of the contents 
of the information portion of the node; it may be sufficient to identify a pointer to the 
desired node. In such a case the value of the pointer variable traversing the list may be 
used instead of the head routine. Note that neither the head nor tail operations change 
the original list in any way. All fields retain the same values that they ha: before the 
routines were called. 

Once the basic forms of head and tail have been implemented, other list opera- 
tions can be implemented either in terms of these operations or by accessing list nodes 
directly. For example, the addon operation may be implemented as follows: 


NODEPTR addon(NODEPTR list, struct infotype *pitem) 

{ : 
NODEPTR newptr; 
newptr = (INFOPTR) malloc (sizeof (struct nodetype)); 
newptr->utype = pitem->utype; 
newptr->info =. pitem->info; 
newptr->next = list; 
return (newptr); 

} /* end addon */ 


Now that addon has been implemented, sethead(list, item) can be implemented by the 
statement list = addon(tail (list), &item), as we mentioned earlicr. Alternatively, the 
sethead operation can be implemented directly by the sequence of statements: 


list->utype = item.utype; 
list->info = item. info; 


The seztail operation may be implemented similarly as follows, using an auxiliary 
routine freelist to free the previous tail of list: 


void settail (NODEPTR *plist, NODEPTR t1) 
{ 
NODEPTR p; 
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p = *plist->next; 
*plist->next = tl; 
freelist(p); 

} /* end settail */ 


Despite the fact that we are using the pointer method, more likely than not, there is no 
need for the previous tail of list. If some other pointer points to this portion of the list 
then the call to freelist should be omitted. 

The implementation of séttail highlights the problem of automatic list manage- 
ment and how to determine when a node should be freed if it may indeed appear in 
more than one context (as in the pointer method or if recursive lists are permitted). As 
mentioned, we examine these issues in Section 9.2. 


Programming Languages and Lists 


Throughout this text we have been treating a list as a compound data structure (a 
collection of nodes), rather than as a native data type (an elementary item such as int 
and char). The reason for this is that we have been working closely with the C language. 
In C, one cannot make a declaration such as 


list x; 


and apply such functions as head and tail to x directly. Rather, the programmer must 
implement lists by writing the necessary procedures and functions for their manipula- 
tion. Other languages, however, do contain lists as elementary data structures with the 
operations crlist, head, tail, addon, sethead, and settail already built into the language. 
(A good example of such a language is LISP.) 

One consequence of the fact that C does not include list manipulation capabi'ities 
is that if a programmer programs a list manipulation application, it is the programmer's 
responsibility to allocate and free the necessary list nodes. As we have seen in this sec- 
tion, that problem is not at all trivial if lists are allowed in all their generality. However, 
any given application can usually be designed more easily using a specific type of list, 
tree or graph, as we have seen in Chapters 4, 5, and 8. Indeed, general list manipula- 
tion techniques are more expensive in terms of both time and space than techniques 
designed specifically for a particular application. (This is a corollary to the axiom that 
a price is always paid for generality.) Thus the C programmer will rarely have occasion 
to use general list manipulation techniques. 

However, a general list processing system, in which the list is a native data type 
and list operations are built-in, must be able to deal with lists in all their generality. 
Since the fundamental objects are lists and data items rather than nodes, the program- 
mer cannot be responsible for allocating and freeing individual nodes. Rather, when a 
program issues a statement such as 


11 = crlist(3,4,7); 


the system is responsible for allocating sufficient list nodes and initializing the proper 
pointers, When the program later issues the command 
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11 = NULL; 


the system is responsible for identifying and freeing those nodes previously on list /1 
that now become inaccessible. If such nodes are not freed, available space would rapidly 
become exhausted. 

In sorne sense, languages that include lists as native data types are of “higher 
level” than C because the programmer is freed from so much of the bookkeeping activity 
associated with storage management. C-may be thought of as a language of higher 
level than FORTRAN, in that C includes data structures such as structures and unions, 
whereas FORTRAN does not. So too, a list processing system is of higher level than C 
in that it includes lists, whereas C does not. 

Another point that should be made concerns the implementation of lists. The im- 
plementation of lists as presented in this section is oriented toward C. Because C per- 
mits the use of unions, it was possible to define a type infotype to contain any of the 
legal data types in our list system. Some languages (for example, PL/!) do not support 
unions. In such languages, the type of a ncde (with certain limited exceptions) is fixed 
in advance. In such languages it would be necessary to separate a list system into list 
nodes and atomic nodes. An atomic node is a node that contains no pointers—only 
a simple data item. Several different types of atomic nodes would exist, each with a 
single data item corresponding to one of the legal data types. A list node contains a 
pointer to an atomic node and a type indicator indicating the type of atomic node to 
which it points (as well as a pointer ts the next node on the list, of course). When it is 
necessary to place a new node on a list, an atomic node of the appropriate type must 
be allocated, its value must be assigned, the list node information’ field must be set 
to point to the new atomic node, and the type field in the list node must be set to the 
proper type. 

To understand how clumsy this situation is, suppose that there are ten different 
types of atomic nodes (there is no reason that an atomic node may rot be an array or 
a stack or a queue or a program label, for example). Each of these must have a unique 
typecode. Further there must be a separate variable declared for each type of atomic 
node. Let us suppose that the typecodes used for the ten types are fl, f2,..., 110 and 
that the atomic node variables are nodel, node2,..., node10. Then each time that an 
atomic node is processed, we would need code such as 


switch (typecode) { 
case tl: /*do something with nodel*/ 
case t2: /*do something with node2*/ 


case tl0: /*do something with nodel0*/ 
} /* end switch */ 


This is a cumbersome organization, and one which we are able to avoid by using unions. 
In the next section of this chapter we examine techniques incorporated into list 
processing systems to recover storage that is no ionger needed. We retain the list 


Structure conventions of this section. but it should be understood that they are not 
absolute. 


Storage Management Chap. 9 
298 


` 


EXERCISES 


9.1.1. How would you implement a general stack and queue in C? Write all the routines 
necessary for doing so. > 

9.1.2. Implement the routines addon, sethead, settail, and crlist in C. 

9.1.3. Write a C subroutine freelist(list) that frees all nodes accessible from a pointer list. If 
your solution is recursive, rewrite it nonrecursively. : 

9.1.4. Rewrite addone2 so that it is nonrecursive. 

9.1.5. Write a C routine dir(list, n) that deletes the nth element of a list. If this nth element 
is itself a list, all nodes accessible through that list should be freed. Assume that a list 
can appear in only one position. 

9.1.6. Implement the function copy(list in C. This routine ‘accepts a pointer list to a general 
list and returns a pointer to a copy of that list. What if the list is recursive? 

9.1.7. Write a C routine that accepts a list pointer and prints the parenthesized notation for 
that list. Assume that list nodes can appear only on a single list, and that recursive lists 
are prohibited. 

9.1.8. What are the advantages and disadvantages of languages in which the type of variables 
need not be declared. as compared with languages such as C? 

9.1.9. Write two sets of list operations to create the lists of Figures 9.1.4b and 9.1.9. 

9.1.10. Redraw all the lists of this section that do not include header nodes so that they are now | 
included. 

9.1.11. Implement the routines addon, head, tail, sethead, and settail in C for lists using the 
following methods. 
(a) Copy method 
(b) Header method 

9.1.12. Implement the list operations for a system that uses doubly linked lists. 


y.2 AUTOMATIC LIST MANAGEMENT 


In the last section we presented the necd for algorithms to determine when a given 
list node is no longer accessible. In this section we investigate such algorithms. The 
philosophy behind incorporating such an algorithm into a programming system is that 
the programmer should not have to decide when a node should be allocated or freed. 
Instead, the programmer should code the solution to the prolem with the assurance 
that the system will automatically allocate any list nodes that are necessary for the lists 
being created and that the system will make available for reuse any nodes that are no 
longer accessible. 

There are two principal methods ised in automatic list management: the refer- 
ence count method and the garbage collection method. We proceed to a discussion of 
each. 


Reference Count Method 


Under this method each node has an additional count field that keeps a count 
(called the reference count) of the number of pointers (both internal and external) to 


that node. Each time that the value of some pointer is set to point to a node, the reference 
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count in that node is increased by 1; each time that the value of some pointer that had 
been pointing to a node is changed, the reference count in that node is decreased by 


Each list operation of a system using the reference count method must make pro- 
vision for updating the reference count of each node that it accesses and for freeing any 
node whose count becomes 0. For example, to execute the statement 


l = tail(1); 
the following operations must be performed: 


p=; 

] = next(1); 
next(p) = null; 
reduce(p); 


where the operation reduce(p) is defined recursively as follows: 


if (p != nul) { 
ccunt(p)--; 
if (count(p) == 0) { 
r = next(p); 
reduce(r); 
if (nodetype(p) == Ist) 
reduce(head(p)); 
. free node(p); 
} /* end if */ 
} /* end if */ 


reduce must be invoked whenever the value of a pointer to a list node is changed. Sim- 
ilarly, whenever a pointer variable is set to point to a list node, the count field of that 
node must be increased by 1. The coun? field of a free node is 0. 

To illustrate the reference count method, consider again the list of Figure 9.1.9, 
The following set of statements creates that list: 


~ 
listl0 = crlist(14,28); 
listll = crlist(crlist(5,7)); 
11 = saddon(Tist11, 42); 
m = crlist(71,head(listl1)); 
list9 = crlist(m, 1ist10,12, 11); 
m= null; 
11 = null; 


Figure 9.2.1 illustrates the creation of the list using the reference count method. Each 
part of that figure shows the list after an additional group of the foregoing statements 
has been executed. The reference count is shown as the leftmost field of each list 
node. Each node in that figure is numbered according to the numbering of the nodes in 
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Figure 9.2.1 (cont.) 


Figure 9.1.9. Make sure that you understand how each statement alters the reference 


count in each node. 


Let us now see what happens when we execute the statement 


list9 = null; 


The results are illustrated in Figure 9.2.2, where freed nodes are illustrated using dashed 
lines. The following sequence of events may take place: 


(Figure 9.2.2a) 


(Figure 9.2.2b) 


count òf node | is set to 0. 

Node | is freed. 

counts of nodes 2 and 1] are set to 0. 
Nodes 2 and 11 are freed. 

counts of nodes 5 and 7 are set to 1. 
counts of nodes 3 and 12 are set to 0. 
Nodes 3 and 12 are freed. 

count of node 4 is set to 0. 

Node 4 is freed. 

count of node 9 is set to 1. 

count of node 7 is set to 0. 

Node 7 is freed. 

count of node 8 is set to 1. 


Only those nodes accessible from the external pointers listl0 and /is111 remain allo- 


cated: all others are freed. 
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One drawback of the reference Count method is illustrated by the foregoing ex- 
ample. The amount of work that must be performed by the system each time that a 
list manipulation statement is executed can be considerable. Whenever a pointer value 
is changed, all nodes previously accessible from that pointer can potentially be freed. 
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Often, the work involved in identifying the nodes to be freed is not worth the reclaimed 
space, since there may be ample space for the program to run to completion without 
reusing any nodes. After the program has terminated, a single pass reclaims all of its 
storage without having to worry about reference count values. 


One solution to this problem can be illustrated by a different approach to the pre- 
vious example. When the statement 


list9 = null 


is executed, the reference count in node 1 is reduced to 0 and node | is freed—that is, 
it is placed on the available list. However, the fields of this node retain their original 
values, so that it still points to nodes 2 and 11. (This means that an additional pointer 
field is necessary to link such nodes on the available list. An alternative is to reuse 
the reference count field for this purpose.) The reference count values in these two 
nodes remain unchanged. When additional space is needed and node | is reallocated 
for some other use, the reference counts in nodes 2 and 11 are reduced to 0 and they 
are then placed on the available list. This removes much of the work from the deallo- 
cation’ process and adds it to the allocation process. If node 1 is never reused because 
enough space is available, nodes 2, 11, 3, 4, 7, and 12 are not freed during program 
execution. For this scheme to work best, however, the available list should be kept as 
a queue rather than as a stack, so that freed nodes are never allocated before nodes 
that have not been used for the first time. (Of course, once a system has been running 
for some time so that all nodes-have been used at least once, this advantage no longer 
exists.) 

There are two additional disadvantages to the reference count method. The first is 
the additional space required in each node for the count. This is not usually an overriding 
consideration, however. The problem can be somewhat alleviated if each list is required 
to contain a header node and a reference count is kept only in the header. However, then 
only a header node could be referenced by more than one pointer (that is, a list such as 
in Figure 9.2.3b would be prohibited). The lists of Figure 9.2.3 are analogous to those 
of Figure 9.1.6 except that they include header nodes. The counts are kept in the first 
field of the header node. When the count in a header node reaches 0, all the nodes on its 
list are freed and the counts in-header nodes pointed to by /stinfo fields in the list nodes 
are reduced. i 

If counts are to be retained in header nodes only, certain operations may have to 
be modified. For example, the settail operation must be modified so that the situation of 
Figure 9.2.3b does not occur. One method of modification is to use the copy method in 
implementing these operations. Another method is to differentiate somehow between 
external pointers, which represent lists (and therefore must point to a header node), and 
“temporary” external pointers, which are used for traversal (and can point directly to 
list nodes). When the count in a header node becomes 0, references to its list nodes 
through temporary pointers become illegal. 

The other disadvantage of the reference count method is that the count in the first ° 
node of a recursive or circular list will never be reduced to 0. Of course, whenever a 
pointer within a list is set to point to a node on that list, the reference count can be 
maintained rather than increased, but detecting when this is so is often a difficult task. 
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Garbage Collection 


Under the reference count method, nodes are reclaimed when they become avail- 
able for reuse (or under one version when they are needed). The other principal method 
of detecting and reclaiming free nodes is called garbage collection. Under this method, 
nodes no longer in use remain allocated and undetected until all available storage has 
been allocated. A subsequent request for allocation cannot be satisfied until nodes that 
had been allocated but are no longer in use are recovered. When a request is made for 
additional nodes and there are none available, a system routine called the garbage col- 
lector is called. This routine searches through all of the nodes in the system, identifies 
those that are no longer accessible from an external pointer, and restores the inaccessi- 
ble nodes to the available pool. The request for additional nodes is then fulfilled with 
some of the reclaimed nodes and the system continues processing user requests for more 
space. When available space is used up again, the garbage collector is called once more. 

Garbage collection is usually done in two phases. The first phase, called the mark- 
ing phase, involves marking all nodes that are accessible from an external pointer. The 
second phase, called the collection phase, involves proceeding sequentially through 
memory and freeing all nodes that have not been marked. We examine the marking 
phase first and then turn our attention to the collection phase. 

One field must be set aside in each node to indicate whether a node has or has not 
been marked. The marking phase sets the mark field to true in each accessible node. As 
the collection phase proceeds, the mark field in each accessible node is reset to false. 
Thus, at the start and end of garbage collection, all mark fields are false. User programs 
do not affect the mark fields. 
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It is sometimes inconvenient to reserve one field in each node solely for the pur- 
pose of marking. In that case a separate area in memory can be reserved to hold a Jong 
array of mark bits, one bit for each node that may be allocated. 

One aspect of garbage collection is that it must run when there is very little space 
available. This means that auxiliary tables and stacks needed by the garbage collector 
must be kept to a minimum since there is little space available for them. An alternative 
is to reserve a specific percentage of memory for the exclusive use of the garbage col- 
lector. However, this effectively reduces the amount of memory available to the user 
and means that the garbage collector will be called more frequently. 

Whenever the garbage collector is called, all user processing comes to a halt while 
the algorithm examines all allocated nodes in memory. For this reason it is desirable that 
the garbage collector be called as infrequently as possible. For real-time applications, 
in which a computer must respond to a user request within a specific short time span, 
garbage collection has generally been considered an unsatisfactory method of storage 
management. We can picture a spaceship drifting off into the infinite as it waits for 
directions from a computer occupied with garbage collection. However, methods have 
recently been developed whereby garbage collection can be performed simultaneously 
with user processing. This means that the garbage collector must be called before all 
space has been exhausted so that user processing can continue in whatever space is left, 
while the garbage collector recovers additional space. 

Another important consideration is that users must be careful to ensure that all 
lists are well formed and that all pointers are correct. Usually, the operations of a list 
processing system are carefully implemented so that if garbage collection does occur in 
the middie of one of them, the entire system still works correctly. However, some users 
try to outsmart the system and implement their own pointer manipulations. This requires 
great care so that garbage collection will work properly. In a real-time garbage collection 
system, we must ensure not only that user operations do not upset list structures that the 
garbage collector must have but also that the garbage collection algorithm itself does 
not unduly disturb the list structures that are being used concurrently by the user. As 
we shall see, some marking algorithms do disturb (temporarily) list structures and are 
therefore unsuitable for real-time use. 

It is possible that, at the time the garbage collection program is called, users are 
actually using almost all the nodes that are allocated. Thus almost all nodes are acces- 
sible and the garbage collector recovers very little additional space. After the system 
runs for a short time, it will again be out of space; the garbage collector will again be 
called only to recover very few additional nodes, and the vicious cycle starts again. This 
phenomenon, in which system storage management routines such as garbage collection 
are executing almost all the time, is called thrashing. 

Clearly, thrashing is a situation to be avoided. One drastic solution is to impose 
the following condition. If the garbage collector is run and does not recover a specific 
percentage of the total space, the user who requested the extra space is terminated and 
removed from the system. All of that user’s space is then recovered and made available 
to other users. 

Algorithms for Garbage Collection 


The simplest method for marking all accessible nodes is to mark initially all nodes 
that are immediately accessible (that is, those pointed to by external pointers) and then 
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repeatedly pass through all of memory sequentially. On each sequential pass, whenever 
a marked node nd is encountered, all nodes pointed to by a pointer within nd are marked. 
These sequential passes continue until no new nodes have been marked in an entire pass. 
Unfortunately, this methud is as inefficient as it is simple. The number of sequential 
passes necessary is equal to the maximum path length to any accessible node (why?), 
and on each pass every list node in memory must be examined. However, this method 
requires almost no additional space. 

A somewhat more efficient variation is the following: Suppose that a node n1 in 
the sequential pass has been previously marked and that n1 includes a pointer to an 
unmarked node, n2. Then node n2 is marked and the sequential pass would ordinarily 
continue with the node that follows n1 sequentially in memory. However, if the address 
of n2 is less than the address of 71, the sequential pass resumes from 2 rather than 
from n1. Under this modified technique, when the last node in memory is reached, all 
accessible nodes have been marked. | i 

Let us present this method as an algorithm. Assume that all list nodes in memory 
are viewed as a sequential array. 


#define NUMNODES ...; 
struct nodetype { 
int mark; 
int utype; 
union { 
int intgrinfo; 
char charinfo; 
int Istinfo; 
} info; 
int next; 
} node [NUMNODES] ; 


An array node is used to convey the notion that we can step through all nodes se- 
quentially. node[0] is used to represent a dummy node. We assume that node[0].lstinfo 
and node[0).next are initialized to 0, node[O].mark to true, and node[0].urtype to Ist, and 
that these values are never changed throughout the system’s execution. The mark fiela 
in each node is initially false and is set to true by the marking algorithm when a node 
is found to be accessible. 

Now that we have defined the format of our nodes, we turn to the actual algorithm. 
Assume that acc is an array containing external pointers to immediately accessible 
nodes, declared by 


#define NUMACC = ...; 
int acc([NUMACC]; 


The marking algorithm is as follows: 
/* mark all immediately accessible nodes */ 
for (i = 0; i < NUMACC; i++) 
node[acc[i]].mark = TRUE; 
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/* begin a sequential pass through the array of nodes */ 
/* 7 points to the node currently being examined i f 
j=1; 
while (i < NUMNODES) { 
j= i+1; /* j points to the. node to be examined next */ 
if (node{i}.mark) { 
/* mark nodes to which i points */ 
if (node{i].utype == LST & 
node{node[i].7stinfo).mark != TRUE) { 
/* the information portion of 7 */ 
/* points to an unmarked node */ 
node([node{i}.1stinfo].mark = TRUE; 
if (node[i].1stinfo < j) 
j = node[i}. Istinfo; 
} /* end if */ 
if (node[node[i}.next].mark != TRUE) { 
/* the list node following */ 
/* node{i] is an unmarked node */ 
node[node[i].next].mark = TRUE; 
if (node[i].next < j) 
j = node[i}.next; 
} /* end if */ 
} /* end if */ 
is j; 


} /* end while */ 


In the exercises you are asked to trace the execution of this algorithm on a list distributed 
throughout memory such as list9 in Figure 9.1.9. 

Although this method is better than successive sequential passes, it is still ineffi- 
cient. Consider how many nodes must be examined if node[ 1] is immediately accessible - 
and points to node[999], which points to node[2}, and so on. Thus it is usually too slow 
to use in an actual system. 

A more desirable method is one that is not based on traversing memory sequen- 
tially but rather traverses all accessible lists. Thus it examines only those nodes that are 
accessible, rather than all nodes. 

The most obvious way to accomplish this is by use of an auxiliary stack and is 
very similar to depth-first traversal of a graph. As each list is traversed through the next 
fields of its constituent nodes, the utype field of each node is examined. If the utype field 
of a node is Ist, the value of the /stinfo field of that node is placed on the stack. When 
the end of a list or a marked node is reached, the stack is popped and the list headed 
by the node at the top of the stack is traversed. In the algorithm that follows, we again 
assume that node[0].mark = true. 


for (i = 0; i < NUMACC; i++) { 
/* mark the next immediately accessible */ 
/* node and place it on the stack */ 
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node[{acc[i]].mark = TRUE; 
push(stack, acc[i]); 
while (empty(stack) != TRUE) { 
p = pop(stack); 
while (p != 0) { 
if (node[p].utype == LST & 
node[node[p].]stinfo).mark != TRUE) { 
node[node[p].1stinfo).mark = TRUE; 
push(stack, node[p].?!stinfo); 
} /* end if */ 
if (node[node[p).next].mark == TRUE) 
p= 0; 
else { 
p = node[p) .next; 
node[p).mark =. TRUE; 
} /* end if */ 
} /* end while */ 
} /* end while */ 
} /* end for */ 


This algorithm is as efficient as we can hope for in terms of time, since each node 
to be marked is visited only once. However, it has a significant weakness because of 
its dependence on an auxiliary stack. A garbage collection algorithm is called when 
there is no extra space available, so where is the stack to be kept? Since the size of 
the stack is never greater than the depth of the list nesting and lists are rarely nested 
beyond some reasonable limit (such as 100), a specific number of nodes reserved for 
the garbage collection stack would suffice in most cases. However, there i is always the 
possibility that some user would want to nest nodes more deeply. 

One solution is to use a stack limited to some maximum size. If the stack is about 
to overflow, we can revert to the sequential method given in the previous algorithm. We 
ask the reader to work out the details in an exercise. 

Another solution is to use the allocated list nodes themselves as the stack. Clearly, 
we do not want to add an additional field to each list node to hold a pointer to the next 
node on the stack, since the extra space could be better used for other purposes. Thus 
either the /stinfo field or next field of the list nodes must be used to link together the 
stack. But this means that the list structure is temporarily disturbed. Provision must be 
made for the lists to be restored properly. 

In the foregoing algorithm, each list is traversed using the next fields of its nodes, 
and the value of each pointer /stinfo to a list node is pushed onto a stack. When either 
the end of a list or a section of the list which had already been marked is reached, the 
stack is popped and a new list is traversed. Therefore, when a pointer to a node nd is 
popped, there is no need to restore any of the fields within nd. 

However, suppose that the stack is kept as a list, linked by the next fields. Then 
when a node is pushed onto the stack, its next field must be changed to point to the top 
node in the stack. This implies that the field must be restored to its original value when 
the node is popped. But that original value has not been saved anywhere. (It cannot be 
saved on the stack, since there is no extra storage available for it.) 
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A solution to this problem can be described by the following scheme. Let us first 
assume a list with no elements, that are themselves lists. As each node in the list is 
visited, it is pushed onto the stack and its next field is used.to link it onto the stack. 
Since each node preceding the current node on the list is also present on the stack (the 
top of the stack is the last encountered element on the list), the list can be reconstructed 
easily by simply popping the stack and restoring the next fields. 

The situation is only slightly different in the case where one list is an element 
of another. Suppose that nd1 is a node on /ist1, that nd2 is a node on list2, and that 
node[nd1).lstinfo = nd2. That is, nd2 is the first node of list2, where list2 is an element 
of list1. The algorithm has been traversing listl and is now about to begin traversing 
list2. In this case node|[nd}|.next cannot be used as a stack pointer because it is needed 
to link nd1 to the remainder of list}. However, the listinfo field of nd1 can be used to 
link nd1 onto the stack, since it is currently being used to link to nd2. 

In general, when a node nd is pushed onto the stack, either its Istinfo field or its 
next field is used to point to the previous top element. If the next node to be examined is 
pointed to by node[nd]./stinfo, the Istinfo field is used to link nd onto the stack, whereas 
if the pode is pointed to by node[nd}.next, the next field is used to link nd onto the stack. 
The remaining problem is how to determine for a given node on the stack whether the 
Istinfo or next field is used to link the stack. 

If the urype field of a node indicates that the node is a simple node, its next field 
must be in use as a stack pointer. (This is because the node ha. no /stinfo field that must 
be traversed.) However, a node with a utype field of Ist is not so easily handled. Suppose 
that each time the /stinfo field is used to advance to the next node, the utype field in the 
list node is changed from {st to some new code (say stk for stack) that is neither Ist nor 
any of the codes that denote simple elements. Then when a node is popped from the 
stack, if its utype field is not stk, its next field must be restored, and if its tag field is stk, 
its /stinfo field must be restored and the urtype field restored to Ist. 

Figure 9.2.4 illustrates how this stacking mechanism works. Figure 9.2.4a shows 
a list before the marking algorithm begins. The pointer p points te the node currently 
being processed, top points to the stack top, and q is an auxiliary poifiter. The mark field 
is shown as the first field in each node. Figure 9.2.4b shows the same list immediately 
after node 4 has been marked. The path taken to node 4 is through the next fields of nodes 
1, 2, and 3. This path can be retraced in reverse order, beginning at top and following 
along the next fields. Figure 9.2.4c shows the list after node 7 has been marked. The 
path to node 7 from the beginning of the list was from node 1, through node[{1].next 
to node 2, through node[2]./stinfo to node 5, through node[5].next to node 6, and then 
from node[6].next to node 7. The same fields that link together the stack are used to 
restore the list to its original form. Note that the utype field in node 2 is stk rather than 
Ist to indicate that its /stinfo field, not its next field, is being used as a stack pointer. The 
algorithm that incorporates these ideas is known as the Schorr—Waite algorithm, after 
its discoverer... 

Now that we have described the temporary distortions that are made in the list 
structure by the Schorr—Waite algorithm, we present the algorithm itself. We invite the 


reader to trace through the effects of the algorithm on the lists of Figures 9.2.4a and 
9.1.9. 
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for (i = 0; i < NUMACC; i++) { ! 


/* for each immediately accessible node, */ 


/* trace through its list +7 
p = acc[i]; 

/* initialize the stack to empty #/ 
top = 0; 


/* Traverse the list through its next fields, marking */ 
/* each node and placing it on the stack until a marked */ 
if node or the end of the list is reached. */ 
/* Assume node(0].mark = true */ 
while (node(p}.mark != TRUE) { 
node[p).mark = TRUE; i 
/* place node{p] on the stack, saving a pointer */ 
g to the next node * 
q = node[p] .next; 
node[p}.next = top; 
top = p; 
/* advance to the next node */ 
p= qi 
} /* end while */; 
/* at this point trace the way back through the list, */ 
/* popping the stack until a node is reached whose */ 
/* Istinfo field points to an unmarked node, or until */ 
/* the list is empty +/ 
while (top != 0) { 
/* restore Istinfo or next to p and pop the */ 
/* : stack */ 
p = top; 
/* restore the proper field of node[p] t/. 
if (node[p].utype == STK) { 
/* Istinfo was used as the stack */ 
/* Vink, Restore the tag field */ 
node[p}.utype = LST; 
/* pop the stack t) 
top = node[top] .Istinfo; 
/* restore the Istinfo field */ 
node[p}.Istinfo = q; 
q= p; 
} /* end if */ 
else { 
/* next was used as the stack */ 
i Vink. Pop the stack. £j 
top = node[ top] .next; 
/* restore the next field */ 
node{p].next = Q; 


q=p; 
/* check if we must travel down */ 
fF node[p]. Istinfo */ 


if (node[p}.utyp¢ == LST) { 
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/* indicate that Istinfois */ 
/* used as the stack link */ 
node[p].utype = STK; 
/* push node[p] on the stack */ 
node[p].]stinfo = top; 

, top = p; ! s 
/* advance to next node */ 


} /* end while */ 
} /* end for */ 


Although this algorithm is advantageous in terms of space since no auxiliary stack 
is necessary, it is disadvantageous in terms of time because each list must be traversed 
twice: once in pushing each node in the list on the stack and once in popping the stack. 
This can be contrasted with the relatively few nodes that must be stacked when an 
auxiliary stack is available, 

Of course, several methods of garbage collection can be combined into a single 
algorithm. For example, an auxiliary stack of fixed size can be set aside for garbage 
collection and when the stack is about to overflow, the algorithm can switch to the 
Schorr—Waite method. We leave the details as an exercise. 


Collection and Compaction 


Once the memory locations of a given system have been marked appropriately, the 
collection phase may begin. The purpose of this phase is to return to available memory 
all those locations that were previously garbage (not used by any program but unavail- 
able to any user). It is easy to pass through memory sequentially, examine each node in 
turn, and return unmarked nodes to available storage. ` 

For example, given the type definitions and declarations presented above, the 
following algorithm could be used to return the unmarked nodes to an available list 
headed by avail: 


for (p = 0; p < NUMNODES; p++) { 
if (node[p].mark != TRUE) { 
node[p].next = avail; 
avail = p; 
4} /* end if */ 
node[p].mark = FALSE; 
} /*.end for */ - 


After this algorithm has completed, all unused nodes are on the available list, 
and all nodes that are in use by programs have their mark fields turned off (for the 
, next call to garbage collection). Note that this algorithm places nodes on the avail- 
able list in opposite order of their memory location. If itiwere desired to rețurn nodes to 
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available memory in the order of increasing memory location, the for loop above could 
be reversed to read f 


for (p = NUMNODES - 1; p >= 1; p--) 


Although at this point (following the marking and collection phases of the system) 
all nodes that are not in use are on the available list, the memory of the system may not be 
in an optimal state for future use. This is because the interleaving of the occupied nodes 
with available nodes may make much of the memory on the available list unusable. For 
example, memory is often required in blocks (groups of contiguous nodes) rather than - 
as single discrete nodes one at a time. The memory request by a compiler for space in 
which to store an array would require the allocation of such a block. If, for example, 
all the odd locations in memory were occupied and all the even locations were on the 
available list, a request for even an array of size 2 could not be honored, despite the 
fact that half of memory is on the available list. Although this example is probably not 
very realistic, there certainly exist situations in which a request for a contiguous block 
of memory could not be honored, despite the fact that sufficient memory does indeed 
exist. , 

There are several approaches to this problem. Some methods allocate and free 
portions of memory in blocks (groups of contiguous nodes) rather than in units of in- 
dividual nodes. This guarantees that when a block of storage is freed (returned to the 
available pool), a block will be available for subsequent allocation: requests. The size of . 
these blocks and the manner in which they are stored, allocated and freed are discussed 
in the next section. 

However, even if storage is maintained as units of individual nodes rather than 
as blocks, it is still possible to provide the user with blocks of contiguous storage. The 
Process of moving all used (marked) nodes to one end of memory and all the available 
memory to the other end is called compaction, and an algorithm that performs such a 
process is called a compaction (or compacting) algorithm. 

The basic problem in developing an algorithm that moves portions of memory 
from one location to another is to preserve the integrity of pointer values to the nodes 
being moved. For example, if node( p) in memory contains a pointer q, when node(p) 
and node(q) are moved, not only must the addresses of node( p) and node(q) be modified 
but the contents of node(p) (which contained the pointer q) must be modified to point to 
the new address of node(q). In addition to being able to change addresses of nodes, we 
must have a method of determining whether the contents of any`node contain a pointer 
to some other node (in which case its value may have to be changed) or whether it 
contains some other data type (so that no change is necessary). 

A number of compaction techniques have been developed. As in the case of mark- 
ing algorithms, because the process is required at precisely the time that little additional 
space is available, methods that require substantial additional Storage (for example, a 
Stack) are not practical. Let us examine one compaction algorithm that does not need 
additional memory when it runs. j 

The compaction algorithm is executed after the marking phase and traverses 
memory sequentially. .Each marked node, as it is encountered in the sequential traver- 
sal, is assigned to the next available memory location starting from the beginning of 
available memory. When examining a marked node nd1 that points to a node nd2, the 
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pointer in ndl that now points to nd2 must be updated to the new location where nd2 
will be moved. That location may not yet be known because nd2 might be at a later 
address than nd1. nd is therefore placed on a list emanating from nd2 of all nodes that 
contain pointers to nd2, so that when the new location of nd2 is determined, nd1 can be 
accessed and the pointer to nd2 contained in it modified. 

For now let us assume that a new field header in each node jd2 points to the list 
of nodes that contain pointers to nd2. We call this list the adjustment list of nd2. We 
can reuse the field that pointed to nd2 (either next or Istinfo) as the link field for the 
adjustment list of nd2; we know that its “real” value is nd2 because the node is on the 
list emanating from header(nd2). Thus once its adjustment list has been formed, when 
nd2 is reached in a sequential traversal, that adjustment list can be traversed and the 
values in the fields used to link that list can be changed to the new location assigned to 
nd2. Then, once all nodes that point to nd2 have had their pointers adjusted, nd2 itself 
can be moved, 

However, there is one additional piece of information that is required. The adjust- 
ment list of nodes pointing to nd2 can be linked via either-the next pointer of a node 
nd] (if next(nd1) = nd2) or the lstinfo pointer (if Istinfo(nd1) = nd2). How can we tell 
which it is? For this purpose, three additional fields in each node are necessary. The 
values of these fields can be either “N” for none, which indicates that a node is not on 
an adjustment list, “I” for info, which indicates that a node is linked onto the adjustment 
list using /stinfo, or “L” for link, which indicates that it is linked onto the adjustment | 
list using next. The three fields are named headptr, infoptr, and nextptr. headptr(nd) 
defines the link field in the node pointed to by header(nd); infoptr(nd) defines the link 
field in the node pointed to by /stinfo(nd); and nextptr(nd) defines the link field in the 
node pointed to by next(nd). 

Thus, we assume the following format for the nodes: 


struct nodetype { 
int mark; 
int header; 
‘int next; 
char headptr; 
char infoptr; 
char nextptr; 
int utype; 
union { 
int intgrinfo; 
char charinfo; 
int Istinfo; 
} info; 
}; 
Now consider a single. sequential pass of the algorithm. If a node ndl points 
to a node nd2 that appears later in memory, by the time the algorithm reaches nd2 
sequentially, ndl will have already. been placed on the adjustment list of #7d2. When 
the algorithm reaches nd2, therefore, the pointers in nd! can be modified. But if nd2 
appears earlier in memory, when nd? is reached, it is not yet known that ndl points to 
it; therefore the pointer in nd1 cannot be adjusted. For this reason the algorithm requires 
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two sequential passes. The first places nodes on adjustment lists and modifies pointers 
in nodes that it finds on adjustment lists. The second clears away adjustment lists re- 
maining from the first pass and actually moves the nodes to their new locations. The 
first pass may be outlined as follows: 


1. Update the memory location to be assigned to the next marked node, nd. 

2. Traverse the list of nodes pointed to by header(nd) and change the appropriate 
pointer fields to point to the new location of nd. 

3. If the utype field of nd is Ist and Istinfo(nd) is not null, place nd on the list of the 
nodes headed by header(Istinfo(nd)). 


4. If next(nd) is not null, place nd on the list of the nodes headed by header(next)(nd). 


Once this process has been completed for each marked node, a second pass through 
memory will perform the actual compaction. During the second pass we perform the” 
following operations: 


1. Update the memory location to be assigned to the next marked node, nd. 


2. Traverse the list of nodes pointed to by header(nd) and change the appropriate 
pointer fields to point to the new location of nd. 


3. Move nd to its new location. 


The following algorithm performs the actual compaction. (We assume an auxil- 
iary variable source, which will contain an “N”, “I”, or “L” as explained before for use 
in traversing the lists.) 


/* initialize fields for compaction algorithm */ 
for (i = 1; i < MAXNODES; i++) { 

node[i]. header = 0; 

node[i].headptr = 'N'; 

node{i].infoptr = 'N'; 

node[i].nextptr = 'N'; 
} /* end for */ 


/* . . Pass 1 */ 
/* Scan nodes sequentially. As each node nd is encountered */ 
/* perform the following operations: | 
/* 1. Determine the new location of the node */ 


/* 2. For all nodes that were previously encountered on */ 
/* this pass that point to nd, adjust the appropriate */ 
/* pointer to point to the new location of nd. t/ 
/* 3. If any of the fields of nd point to some other node, */ 
/* . p, place nd on the list headed by node[p]. header t/ 


newloc = 0; 

for (nd = 1; nd < MAXNODES; nd++) 
if (node[nd].mark == TRUE) { 
/* nodes that are not marked are */ 
f* to be ignored. */ 
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newloc++; /* operation 1 A a 
/* operation 2 */ 
p = node[nd)}.. header; 
à source = node[nd]) .headptr; 
while (p != 0) 
/* traverse the list of nodes */ 
/* encountered thus far that */ 
/* point to nd */ 
if (source == 'I') { . 
q = node[p].Istinfo; - 
source = node[p).infoptr; 
node[p}.Istinfo = newloc; 
node[p).infoptr = 'N'; 
p=q 
} /* end if*/ 
else { 
q = node[p].next; 
source = node[p]).nextptr; 
node[p).next = newloc; 
— nextptr = 'N'; 
a Be! 
} i a else */ 
node[nd) .headptr = 'N'; 
node(nd) .header = 0; 
/* operation 3 */ 
if ((node[nd] .utype == LST) && 
(node(nd).Istinfo != 0) { 
/* place node[nd] on a list */ 
/* Vinked by node[nd). Istinfo */ 
p = node[nd).Istinfo; 
node[nd].Istinfo = node[p).header; 
node[nd].infoptr = node(p).headptr; 
node[p) header = nd; 
node[p}.headptr = 'I'; 
} /* end if */ 
/* place node[nd] on a list linked by */ S 
/* node[nd) .next 7 


p = node[nd) .next; 
node[nd] .next = node[p].header; 
node[nd].nextptr = node[p) .headptr; 
if (p != 0) { 
node[p].header = nd; 
node[p].headptr = 'L'; 
} /* end if */ 
} /* end if node[nd).mark */ 
/* Pass 2: This pass examines each node nd in turn, */ 


/* updates all nodes on the adjustment list of and */ 
/* then moves the contents of nd to its new location.  ¢/ 
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newloc = 0; =; 
for (nd = 1; nd < MAXNODES; nd++) 
if (node[nd].mark) { / 
newloc++; , 
p = node[nd) . header; 
source = node[nd].headptr; 
while (p != 0) 
if (source == 'I') { 
q = node[p). Istinfo; 
source = node[p].infoptr; 
node[p].Istinfo = newloc; `> 
node[p].infoptr = 'N'; 
p=q 
} 
else { ` 
q = node[p).next; 
source = node[p].nextptr; 
node[p}.next = newloc; 
. node[p] .nextptr = 'N'; 
pP=q 
} /* end if */ 
node[nd] .headptr = 'N'; 
node[..d].header = 0; 
node[nd].mark = false; 
node[newloc] = node[nd]; 
} /* end if Hode[nd].mark */ 


Several points should be noted with respect to this algorithm. First, node[0} is 
suitably initialized so that the algorithm need not test for special cases. Second, the 
process of adjusting the pointers of all nodes on the list headed by the header field of 
a particular node is performed twice: once during the first pass and once during the 
second. This process could not be deferred entirely to the second pass, when all the 
pointers to a particular node are known. The reason for this is that when a field in a 
node nd2 in the adjustment Jist of node nd is changed to nd, it must be changed before 
nd2 is moved to a new location, since no record of the new location is maintained in 
nd2. Thus nodes on the adjustment list of nd that precede nd sequentially must have 
their fields modified before they have been moved. But since they are moved before we 
reach nd in the second pass, and they have already been placed on the adjustment list 
by the time we reach nd in the first pass, we must clear the adjustment lists'and modify 
the pointer fields at that point. We also must modify pointer fields in the second Pass 
for nodes on the adjustment list of nd that are sequentially after nd and were put on the 
adjustment list of nd during the first pass after having already passed nd. 

` The algorithm seems to require several additional fields for each node. In reality. 
these additional fields are not required. Most systems have at least one field in each 
node that cannot take on a pointer value during the ordinary course of processing. This 
field can be used to hold the header pointer to the adjustment list, so that an additional 
header field is not necessary. The value that was held in this field can be moved to the 
last node in the adjustment list, and placed in either the next or Istinfo field, depending 
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on which of the two held the pointer to the target node. We assume that it is possible 
to distinguish between a Pointer and a nonpointer value so that we can detect when we 
reach the end of the adjustment list by the presence of a nonpointer value in the last 
node. 


In addition, we used the fields headptr, nextptr, and infoptr to indicate which field 


and we can eliminate headptr, nexiptr, and infoptr from the nodes, ; 

We therefore see that our compaction algorithm can be modified so that it does 
not require ány additional storage in the nodes. Such an algorithm is called a bounded 
workspace alyorithm. 

The time requirements of the algorithm are easy to analyze. There are two linear 
passes throughout the complete array of memory. Each pass through memory scans each 
node once and adjusts any pointer fields to which the nodes point. The time requirements 


be sufficiently fragmented to warrant a call to the compaction routine, so that only after 

several calls to the collection routine is it necessary to cali the compaction algorithm. 
On the other hand, if the Compaction routine is not called sufficiently often, the 

system may indicate that insufficient Space is available when in fact there is sufficient 


Variations of Garbage Collection 


There are a number of recently discovered variations of the garbage collection 
systems just presented. In the traditional schemes we have considered, the applications 


size of contiguous memory locations available, the amount of memory requested since 
the last garbage collection phase. and so forth.) When these criteria are no longer met. all 
applications programs halt and the System directs its resources to garbage collection. 
Once the collection has completed, the applications Programs may resume execution 


In some situations, however, this is Not satisfactory. Applications that are execut- 
ing in real time (for example, computing the trajectory of a Spaceship, or monitoring 


ga 
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chemical reaction) cannot be halted while the system is performing garbage collection. 
In these circumstances it is usually necessary to dedicate a separate processor devoted 
exclusively to the job of garbage collection. When the system signals that garbage col- 
lection must be performed, the separate processor begins executing concurrently with 
the applications program. Because of this simultaneous execution, it is necessary to 
guarantee that nodes that are in the process of being acquired for use by an application 
program are not mistakenly returned to the available pool by the collector. Avoiding 
such problems is not a trivial process. Systems that allow the collection process to pro- 
ceed simultaneously with the applications program use “on-the-fly” garbage collection. 

Another subject of interest deals with minimizing the cost of reclaiming unused 
space. In the methods we have discussed, the cost-of reclaiming any portion of storage is 
the same as the cost of reclaiming any other'portion (of the same size). Recent attention 
has been directed toward designing a system in which the cost of reclaiming a portion 
of storage is proportional to its lifetime. It has been shown empirically that some por- 
tions of memory are required for smaller time intervals than are others and that requests 
for portions of memory with smaller lifetimes occur more frequently than do requests 
for portions of memory with longer lifetimes. Thus, by reducing the cost of retrieving 
portions of memory required for short time periods at the expense of the cost of retriev- 
ing portions of memory with longer lifespans, the overall cost of the garbage collection 
process will be reduced. Exactly how one classifies the lifetimes of portions of memory 
and algorithms for retrieving such portions of memory will not be considered further. 
The interested reader is referred to the references. 

The process of garbage collection is also applied to reclaiming unused space in 
secondary devices (for example, a disk). Although the concept of allocation and free- 
ing space is the-same (that is, space may be requested or released by a program), algo- 
rithms that manage Space on such devices often cannot be translated efficiently from 
their counterparts that manipulate main memory. The reason for this is that the cost 
of accessing any location in main memory is the same as that of accessing any other 
location in main memory. In secondary storage, on the other hand, the cost depends on 
the location of storage that is currently being accessed as well as the location we desire 
to access. It is very efficient to access a portion of secondary storage that is in the same 
block that is now being accessed; to access a location in-a different block may involve 
expensive disk seeks. For this reason, device management systems for offline storage 
try to minimize the number of such accesses. The interested reader is referred to the 
Jiterature for a discussion of the relevant techniques. 


EXERCISES 


9.2.1. Implement each of the following list operations of Section 9.1 in C assuming that the 
reference count method of list management is used. 
(a) head 
(b) rail 
(c) addon 
(d) sethead ` 
(e) settail 
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9.2.2. 


9.2.3. 


9.2.4. 


9.2.5. 


9.2.6. 


9.2.7. 


9.2.8. 


9.2.9, 


9.2.10. 


9.2.11. 


9.2.12. 


Rewrite the routines of the previous exercise under the system in which the reference 
counter in a node ndI is decremented when a node nd2 pointing to nd1 is reallocated, 
rather than when nd2 is freed. 

Implement the list operations of Exercise 9.2.1 in C assuming the use of list headers, 
with reference counts in header nodes only. Ensure that illegal Jists are never formed. 
Write an algorithm to detect recursion in a list, that is, whether or not there is a path 
from some node on the list back to itself. f 

Write an algorithm to restore all nodes on a list list to the available list. Do the same 
using no additional storage. 

In a multiuser environment where several users are running concurrently, it may be 
possible for oné user to request additional storage and thus invoke the garbage collector 
while another user is in the middle of list manipulation. If garbage collection is allowed 
to proceed at that point (before the lists of the second user have been restored to legal 
form), the second user will find that many list nodes have been freed. Assume that there 
exist two system routines nogarbage and okgarbage. A call to the first inhibits the 
invocation of garbage collection until after the same user calls the second. Implement 


_ the list operations of Exercise 9.2.1, using calls to these two routines to ensure that 


garbage collection is not invoked at inopportune moments. 
Trace the actions of the three garbage collection algorithms of the text on the lists of 
Figures 9.2.4a and 9.1.9, assuming that the integer above each list node is the index 


of that node in the array node. Trace through the algorithms on the list of Figure 9.1.9, 
after executing the statement 


list9 = NULL; 
Given pointers p and q to two list nodes, write an algorithm to determine whether 


node(q) is accessible from node( p). 


Assume that each node contains an arbitrary number of pointers to other nodes rather 
than just two so that the lists now become graphs. Revise each of the marking algo- 
rithms presented in this section under this possibility. 

Revise each of the marking algorithms presented in this section under the assumption 


. that the lists are doubly linked, so that each list node contains a prevptr field to the 


previous node on the same list. How do each of the algorithms increase in efficiency? 
What restriction on the list structure does the presence of such a field imply? 

Write two marking algorithms that use a finite, auxiliary stack of size stksize. The algo- 
rithms operate like the second marking algorithm presented in the text until the stack 
becomes full. At that point, the first of the two algorithms operates like the sequential 
algorithm presented in the text and the second operates like the Schorr—Waite algo- 
rithm. 

Can you rewrite the Schorr—Waite algorithm to eliminate the goto statement? 


9.3 DYNAMIC MEMORY MANAGEMENT 


In the previous sections we assumed that storage is allocated and freed one node at a 
time. There are two characteristics of nodes that make the previous methods suitable. 
The first is that each node of a given type is of fixed size and the second is that the size 
of each node is fairly small. In some applications, however, these characteristics do not 
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apply. For example, a particular program might require a large amount of contiguous 
storage (for example, a large array). It would be impractical to attempt to obtain such 
a block one node at a time. Similarly, a program may require storage blocks in a large 
variety of sizes. In such cases a memory management system must be able to process 
requests for variable-length blocks. In this section we discuss some systems of this type. 

As an example of this situation, consider a small memory of 1024 words. Suppose 
a request is made for three blocks of storage of 348, 110 and 212 words, respectively. 
Let us further suppose that :hese blocks are allocated sequentially, as shown in Figure 
9.3.1a. Now suppose that the second block of size 110 is freed, resulting in the situation 
depicted in Figure 9.3.1b. There are now 464 words of free space; yet, because the free 
space is divided into noncontiguous blocks, a request for a block of 400 words could 
not be satisfied. f 

Suppose that block 3 were now freed. Clearly, it is not desirable to retain three 
free blocks of 110, 212, and 354 words. Rather the blocks should be combined into a 
single large block of 676 words so that further large requests can be satisfied. After 
combination, memory will appear as in Figure 9.3.1c. 

This example illustrates the necessity to keep track of available space, to allocate 
portions of that space when allocation requests are presented. and to combine contiguous 
free spaces when a block is freed. 


Compaction of Blocks of Storage 


One scheme sometimes used involves compaction of storage as follows: Initially 
memory is one large block of available storage. As requests for storage arrive, blocks 
of memory are allocated sequentially starting from the first location in memory. This is 
illustrated in Figure 9.3.2a. A variable freepoint contains the address of the first location 
following the last block allocated. In Figure 9.3.2a, freepoint equals 950. Note that all - 
memory locations between freepoint and the highest address in memory are free. When 
a block is freed, freepoint remains unchanged and no combinations of free spaces take 
place. When a block of size n is allocated, freepoint is increased by n. This continues 
until a block of size n is requested and freepoint + n — 1 is larger than the highest 
address in memory. The request cannot be satisfied without further action being taken. 

At that point user routines come to a halt and a system compaction routine is 
called. Although the algorithm of the previous section was designed to address uniform 
nodes, it could be modified to compact memory consisting of blocks of storage as well. 
Such a routine copies all allocated blocks into sequential memory locations starting 
from the lowest address in memory. Thus all free blocks that were interspersed with 
allocated blocks are eliminated, and freepoint is reset to the sum of the sizes of all the 
allocated blocks. One large free block is created at the upper end of memory and the user 
request may be filled if there is sufficient storage available. This process is illustrated 
in Figure 9.3.2 on a memory of 1024 words. 

When allocated blocks are copied into lower portions of memory, special care 
must be taken so that pointer values remain correct. For example, the contents of mem- 
ory location 420 in allocated block 2 of Figure 9.3.2a might contain the address 340. 
After block 2 is moved to locations 125 through 299. location 140 contains the previous 
contents of location 340. In moving the contents of 420 to 220, those contents must be 
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changed to 140. Thus in order for compaction to be successful, there must be a method 
to determine if the contents of a given location is an address. 

An alternative is a system that computes addresses as offsets from some base 
address. In that case only the contents of the base address must be changed, and the 
offset in memory need not be altéred. For example, in the previous instance, location 
420 would contain the offset 15 before compaction, rather than the address 340. Since 
the base address of the block is 325, the address 340 would be computed as the base 

. address 325 plus the offset 15. When the block is moved, its base address is changed 
to 125, while the offset 15 is moved from location 420 to location 220. Adding the 
new base address 125 to the offset 15 yields 140, which is the address to which the 
contents of 340 have been moved. Note that the offset 15 contained in memory-has not 
been changed at all. However, such a technique is useful only for intrablock memory 
references; interblock references to locations in a different block must still be modified. 
A compaction routine requires a method by which the size df a block and its status 
(allocated or free) could be determined. 6 

Compaction is similar to garbage collection in that all user processing must stop 
as the system takes time to clean up its storag~ For this reason, and because of the 
pointer problem discussed in the foregoing, compaction is not used as frequently as the 
more complicated schemes that follow. 


First Fit, Best Fit, end Worst Fit 


If it is not desirable to move blocks of allocated storage from one area of memory 
to another, it must be possible to reallocate memory blocks that have been freed dynam- 
ically as user processing continues. For example, if memory is fragmented as shown 
in Figure 9.3.1b and a request is made for a block of 250 words of storage, locations 
670 through 919 would be used. The result is shown in Figure 9.3.3a. If memory is as 
shown in Figure 9.3.1b, a request for a bléck of 50 words could be satisfied by either 
words 348 through 397 or words 670 through 719 (see Figure 9.3.3b and c). In each 
case part'of a free block becomes allocated, leaving the remaining portion free. 

Each time that a request is made for storage, a free area large enough to accom- 
modate the size requested must be located. The most obvious method for keeping track 
of the free blocks is to use a linear linked list. Each free block contains a field contain- 
ing the size of the block and a field containing a pointer to the next free block. These 
fields are in some uniform location (say, the first two words) in the block. If p is the 
address of a free block, the expressions size( p) and next( p) are used to refer to these, 
two quantities. A global pointer freeblock points to the first free block on this list. Let 
us see how blocks are removed from the free list when storage is requested. We then 
examine how blocks are added onto this list when they are freed. 

Consider the situation of Figure 9.3.1b, reproduced in Figure 9.3.4a to show the 
free list. There are several methods of selecting the free block to use when requesting 
storage. In the first-fit method, the free list is traversed sequentially to find the first 
free block whose size is larger than or equal to the amount requested. Once the block 
is found, it is removed from the list (if it is equal in size to the amount requested) or 
is split into two portions (if it is greater than the amount requested). The first of these 
portions remains on the list and the second is allocated. The reason for allocating the 
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second portion rather than the first is that the free list next pointer is at the 
of cach free block. By leaving the first portion ofthe block on the free list, this poime 
need not be copied into some other location, and the next field of the previous block in 


The following first-fit allocation algorithm returns the address of a free block of 
se orage of size n in the variable alloc if one is available and sets alloc to the null address 


p = freeblock; 
alloc = nul); 
q = null; 
while {p != nul] & size(p) < n) { 
q= p; 
p = next(p); 
} /* end while */ 
' if (p != null) { /* there is a block large enough */ 
S = size(p); 
alloc = p+s-n; /* alloc contains the address */ 
/* of the desired block */ 
if (s == n) : 


/* remove the block from the free list */ 
if (q == nul?) 
freeblock = next(p); 
else 
next(q) = next(p); 
else /* adjust the size of the remaining */ 
£ free block */ 
size(p) = s - n; ' 
} /* end if */ 


The best-fit method obtains the smallest free block whose size is greater than or 
equal to n. An algorithm to obtain such a block by traversing the entire free list follows. 
We assume that memsize is the total numbér of words in memory. 


p = freeblock; /* p is used to traverse the free list */ 
q = null; /* q is one block behind p sf 
r = null; /* r points to the desired block ,. af 
rq = null; /* rq is one block behind r */ 


rsize = memsize + 1; /* rsize is the size of the block at r + 
alloc = null; /* allot will point to the block selected */ 
while (p != null) { 
vif (size(p) >= n && size(p) < rsize) { 
/* we have found a free block Closer in size */ 
r= pj 
rg = q; 
rsize’= size(p); 
} /* end if */ A 
, /* continue traversing the free list */ 
G= Dp; 
p = next(p); A ! 
} /* end while */ 
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if (r t= null) { 
/* there is a block of sufficient size */ 
alloc = r + rsize - n; 
if (rsize == n) 
/*remove the block from the free list*/ 
if (rq == null) 
freeblock = next(r); 
else 
next(rq) = next(r); 
else 
size(r) = rsize - n; 
} /* end if */ 


To see the difference between the first-fit and best-fit methods, consider the fol- 
lowing examples. We begin with memory fragmented as in Figure 9.3.4a. There are 
two blocks of free storage, of sizes 110 and 354. If a request is made for a block of 300 
words, the block of 354 is split as shown in Figure 9.3.4b under both the first-fit and 
best-fit methods. Suppose a block of size 25 is then requested. Under first-fit, the block 
of size, 110 is split (Figure 9.3.4c), whereas under best-fit the block of size 54 is split 
(Figure 9.3.4d). If a block of size 100 is then requested, the request can be fulfilled 
under best-fit, since the block of size 110 is available (Figure 9.3.4e), but it cannot be 
fulfilled under first-fit. This illustrates an advantage of the best-fit method in that very 
large free biocks remain unsplit so that requests for large blocks can be satisfied. In 
the first-fit method, a very large block of free storage at the beginning of the free list 

is nibbled away by small requests so that it is severely shrunken by the time a large 
request arrives. 

However, it is also possible for the first-fit method to succeed where the best-fit 
method fails. As an example, consider the case in which the system begins with free 
blocks of size 110 and 54 and then makes successive requests for 25, 70, and 50 words. 
Figure 9.3.5 illustrates that the first-fit method succeeds in fulfilling these requests, 
whereas the best-fit method does not. The reason is that remaining unallocated portions 
of blocks are smaller under best-fit than under first-fit. 

Yet another method of allocating blocks of storage is the worst-fit method. In this 
method the system always allocates a portion of the largest free block in memory. The 
philosophy behind this method is that by using a small number of very large blocks 
repeatedly to satisfy the majority of requests, many moderately sized blocks will be left 
unfragmented. Thus, this method is likely to satisfy a larger number of requests than the 


Blocks remaining using 


cannot be 
fulfilled Figure 9.3.5 
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other methods, unless most of the requests are for very large portions of memory. For 
example, if memory consists initially of blocks of sizes 200, 300, and 100, the sequence 
of requests 150, 100, 125, 100, 100 can be satisfied by the worst-fit method but not by 
either the first-fit or best-fit methods. (Convince yourself that this is the case.) 

The major reason for choosing one method over the other is efficiency. In each of 
the methods, the search can be made more efficient, For example, a true first-fit method, 
which allocates the block at the lowest memory address first, will be most efficient if the 
available list is maintained in the order of increasing memory address (as it should be for 
reasons to be discussed shortly). On the other hand, if the available list is maintained 
in the order of increasing size, a best-fit search for.a block becomes more efficient. 
And finally, if the list is maintained in the order of decreasing size, a worst-fit request 
requires no searching, as the largest size block is always the first on the list. However, 
for reasons we shall discuss shortly, it is not practical to maintain the list of available 
blocks ordered by size. 

Each of the methods has certain characteristics that make it either desirable or 
undesirable for various request patterns. Inthe absence of any specific consideration to 
the contrary, the first-fit method is usually preferred. 


Improvements in the First-Fit Method 


There are several improvements that can be made in the first-fit method. If the 
size of a free block is only slightly larger than the size of the block to be allocated, 
the portion of the free block that remains free is very small. Very often this remaining 
portion is so small that there is little likelihood of its being used before the allocated 
portion is freed and the two portions are recombined. Thus there is little benefit achieved 
by leaving that small portion on the free list. Also recall that any free block must be of 
some minimum size (in our case, two words) so that it may contain size and next fields. 
What if the smaller portion of a free block is below this minimum size after the larger 
portion has been allocated? A 
, The solution to these problems is to insist that no block may remain free if its 
size is below some reasonable minimum. If a free biock is about to be split and the 
remaining portion is below this minimum size; the block is not split. Instead, the entire 
free block is allocated as though it were exactly the right size. This allows the system 
"to remove the entire block from the free list and does not clutter up the list with very 
small blocks. 7 

The phenomenon in which there are many small noncontiguous free blocks is 
called external fragmentation because free space is wasted outside allocated blocks. 
This contrasts with internal fragmentation, in which free space is wasted within al- 
located blocks. The foregoing solution transforms external fragmentation into internal 
fragmentation. The choice of what minimum size to use depends on the pattern of allo- 
cation requests in the particular system. It is reasonable to use a minimum size such that 
only a small percentage (say 5 percent) of the allocation requests are less than or equal 
to that size. Note that the possibility of small slivers remaining is even greater under the 
best-fit method than under first-fit, go that the establishment of such a minimum size is 
of correspondingly greater importance under that method. 

Another significant improvement in the first-fit method can be made. As time 
goes on, smaller free blocks will tend to accumulate near‘the front of the free list. This is 


“ 
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because a large block near the front of the list is reduced in size before a i block 
near the back of the list. Thus, in searching for a large or even a moderate-size block, 
the small blocks near the-front cannot be used. The algorithm would be more efficient 
if the free list were organized as a circular list whose first element varies dynamically 
as blocks are allocated. 

Two ways of implementing this dynamic variance suggest themselves. In the first, 
freeblock (which is the pointer to the first free block on the list) is set to next(freeblock), 
so that the front of the list advances one block each time that a block is allocated. In the 
second, freeblock is set to next(alloc), where alloc points to the block just chosen for 
allocation. Thus all blocks that were too small for this allocation request are in effect 
moved to the Back of the list. The reader is invited to investigate the advantages and 
disadvantages of both techniques. 


Freeing Storage Blocks 


Thus far nothing has been said about how allocated blocks of storage are freed 
and how they are combined with contiguous free blocks to form larger blocks of free 
storage: Specifically, three questions arise: 


1. When a block of storage is freed, where is it placed on the free list? The answer 
to this question determines how the free list is ordered. 

2. When a block of storage is freed, how can it be determined whether the blocks of 
storage on either side of it are free (in which case the newly freed block should 
be combined with an already existing free block)? 


3. What is the mechanism for combining a newly freed block with a previously free 
contiguous block? 


The term liberation is used for the process of freeing an.allocated block of storage; 
an algorithm to implement this process is called a liberation algorithm. The free list 
should be organized to facilitate efficient allocation and liberation. 

Suppose that the free list is organized arbitrarily, so that when a block is freed, 
it is placed at the front of the list. It may be that the block just freed is adjacent to a 
previously free block. To create a single large free block, the newly freed block should 
be combined with the adjacent free block. There is no way, short of traversing the entire 
‘free list, to determine if such an adjacent free block exists. Thus each liberation would 
involve a traversal of the free list. For this reason it is inefficient to maintain the free 
list this way. 

An alternative is to keep the free list sorted in order of increasing memory locatiqn. 
Then, when a block is freed, the free list is traversed in a search for the first free block 
fb whose starting address is greater than the starting address of the block being freed. 
If a contiguous free block is not found in this search, no such contiguous block exists 
and the newly freed block can be inserted into the free list immediately before fb. If fb 
or the free block immediately preceding fb on the free list is contiguous to the newly 
freed block, it can be combined with that newly freed block. Under this method, the 


entire free list need not be traversed. Instead, only half of the list must be traversed on 
the average. 
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The following liberation algorithm implements this scheme, assuming that the 
free list is linear (not circular) and that freeblock points to the free black with the smal- 
lest address. The algorithm frees a block of size n beginning at address alloc. 


q = null; 
p= freeblock; 
/* p traverses the free list. q remains one step ‘behind p*/ 
while (p != null & p < alloc) { 
q= p; 
p = next(p); 
} /* end while */ 
/* At this point, either q = null.or q < alloc and either */ 
/* p= null or alloc < p. Thus if p and q are not null, = */ 
/* the block must be combined with the blocks beginning at */ 
/* por qor both, or must be inserted in the list between */ 
/* the two blocks. a7 
if (q == null) 
freeblock = alloc; 
else 
if (q + size(q) == alloc) { 
/* combine with previous block */ 
alloc = q 
n = size(q) + n; 


else i ý 
next(q) = alloc; 
if (p != null & alloc + n == p) { 
/* combine with subsequent block */ 
size(alloc) = n + size(p); 
next(alloc) = next(p); 


else { 

_ $ize(alloc) = n; 
next(alloc) = p; 

} /* end if */ 


If the free list is organized as a circular list, the frst-fit allocation algorithm begins 
traversing the list from varying İocations. However, to traverse the list from the lowest 
location during liberation, an additional external pointer, lowblock, to the free block 
with the lowest location is required. Ordinarily, traversal starts at lowblock during lib- 
eration. However, if it is found that freeblock < alloc when the block that starts at alloc 
is about to be freed, traversal starts at freeblock, so that even less search time is used 
during liberation. The reader is urged to implement this variation as an exercise. 


Boundary Tag Method 


Itis desirable to eliminate all searching during liberation to make the process more 
efficient. One method of doing this comes at the expense of keeping extra information 
in all blocks (both free and allocated). 
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A search is necessary during libefation to determine if the newly freed block may 
be combined with some existing free block. There is no way of detecting whether such a 
block exists or which block it is without a search. However, if such a block exists, it must 
immediately precede or succeed the block being freed. The first address of the block 
that follows a block of size n at alloc is alloc + n. Suppose that every block contains 
a field flag that is true if the block is allocated and false if the block is free. Then by 
examining flag(alloc + n), it can be determined whether or not the block immediately 
following the block at alloc is free. sO 

It is more difficult to determine the status of the block immediately preceding 
the block at alloc. The address of the last location of that preceding block is, of course, 
alloc — 1. But there is no way of finding the address of its first location without knowing” 
its size. Suppose, however, that each block contains two flags: fflag and bflag, both of 
which are true if the block is allocated and false otherwise. fflag is at a specific offset 
from the front of the block, and bflag is at a specific negative offset from the back of 
the block. 

Thus, to access fflag, the first location of the block must be known; to access 
bflag, the last location of the block must be known. The status of the block follow- 
ing the block at alloc can be determined from the value of Sflag(alloc + n), and the 
status of the block preceding the block at alloc can be determined from the value of 
bflag(alloc — 1). Then, when a block is to be freed, it can be determined immediately 
whether it must be combined with either of its two neighboring blocks. 

A list of free blocks is still needed for the allocation process. When a block is 
freed, its neighbors are examined. If both blocks are allocated, the block can simply 
be appended to the front of the free list. If one (or both) of its neighbors is free, the 
neighbor(s) can be removed from the free list, combined with the newly freed block, 
and the newly created large block can be placed at the head of the free list. Note that this 
would tend to reduce search times under first-fit allocation as well, since a previously 
allocated block (especially if it has been combined with other blocks) is likely to be 
large enough to satisfy the next allocation request. Since it is placed at the head of the 
free list, the search time is reduced sharply. 

To remove an arbitrary block from the free list (to combine it with a newly freed 
block) without traversing the entire list, the free list must be doubly linked. Thus each. 
free block must contain two pointers, next and prev, to the next and previous free blocks 
on the free list. It is also necessary to be able to access these two pointers from the last 
location of a free block. (This is needed when combining a newly freed block with a 
free block that immediately precedes it in memory.) Thus the front of free block must 
be accessible from its rear. One way to do’this is to introduce a bsize field at a given 
negative offset from the last location of each free block. This field contains the same 
value as the size field at the front of the block. Figure 9.3.6 illustrates the structure of 
free and allocated blocks under this method, which is called the boundary tag method. 
Each of the-control fields, fflag, size, next, prev, bsize, and bflag is shown as occupying 
a complete word, although in practice they may be packed together, several fields to a 
word. { 

We now present the liberation algorithm using the boundary tag method. For clar- 
ity, we assume that fflag and bflag are logical flags and that rrue indicates an allocated 
block and false indicates a free block. [We assume that bfiag(0) and fflag(m), where m 
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Allocated block Figure 9.3.6 


is the size of memory, are both true.] The algorithm frees a block of size n at location 
alloc. It makes use of an auxiliary routine remove that removes a block from the doubly 
linked list. The details of that routine are left as an exercise for the reader. 


/*check the preceding block*/ 
if (bflag(alloc - 1) != TRUE) { 
/* the block must be combined with the preceding block */ 


start = alloc - bsize(alloc - 1); /* find the initial ef 
/* address of the block */ 
remove(start); /* remove the block from the free list */ 


/* increase the size and combine the blocks */ 
n = n + size(start); 
alloc = start; 
} /* end if */ 
/* check the following block */ 
if (fflag(alloc + n) != TRUE) { 
/* the block must be combined with the following block */ 
start = alloc + n; è 
n = n + size(start); 
remove(start); 
} /* end if */ 
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/* add the newly free, possibly combined */ 
f* block to the free list */ 
next(alloc) = freeblock; 

prev(freeblock) = alloc; 

prev(alloc) = null; 

freeblock = alloc; 

/* adjust the fields in the new block */ 
fflag(alloc) = FALSE; 

bflag(alloc + n - 1) = FALSE 

size(alloc) = n; 

bsize(alloc + n - 1) = n; 


Of course, the newly freed bluck can be inserted into the list based on its size, so that 
one of the other methods (for example, best-fit or worst-fit) can also be used. 


Buddy System 

An alternative method of handling the storage management problem without fre- 
quent list traversals is to keep separate free lists for blocks of different sizes. Each list 
contains free blocks of only one specific size. For example, if memory contains 1024 
words it might be divided into fifteen blocks: one block of 256 words, two blocks of 
128 words, four blocks of 64 words, and eight blocks of 32 words. Whenever Storage is 

. Tequested, the smallést-block whose size is greater than or equal to the size needed 
is reserved. For example, a request for a block of 97 words is filled by a block of 
size 128. 

There are several drawbacks to this scheme. First, space is wasted due to internal 
fragmentation. (In the example, 31 words of the block are totally unusable). Second, and 
more serious, a request for a block of size 300 cannot be filled, since the largest size 
maintained is 256. Also, if two blocks of size 150 are needed, the requests cannot be 
filled even if sufficient contiguous space is available. Thus, the solution is impractical. 
The source of the impracticality is that free Spaces are never combined. However, a 
variation of this scheme, called the buddy system, is quite useful. 

Several free lists consisting of various sized blocks are maintained. Adjacent free 
blocks of smaller size may be removed from their lists, combined into free blocks of 
larger size, and placed on the larger size free list. These larger blocks can tHen be used 
intact to satisfy a request for a large amount of memory or they can be split once more 
into their smaller constituent blocks to satisfy several smaller requests. 

The following method works best on binary computers in which the memory size 
is an integral power of 2 and in which multiplication and division by 2 can be per- 
formed very efficiently by shifting. Initially, the entire memory of size 2” is viewed as 
a single free block. For each power of two between 1 (which equals 2°) and 2”, a free 
list containing blocks of that size is maintained. A block of size 2! is called an i-block 
and the free list containing i-blocks is called the i-list. (In practice, it may be unrea- ` 
sonable to keep free blocks of sizes 1, 2, and 4, so that 8 is the smallest free block size * 
allowed; we will ignore the possibility.) However, it may be (and usually is) the case 

that some of these free lists are empty. Indeed, initially all the lists except the m-list 
are empty. 
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. Blocks may be allocated only in sizes 2‘ for some integer k between 0 and 
m. If a request for a block of size n is made, an i-block is reserved where į is the 
smallest integer such that n <= 2!. If no i-block is available (the i-list is empty), 
an (i + 1)-block’is removed from the (i + 1)-list and is split into two equal size 
buddies. Each of these buddies is an i-block. One of the buddies is allocated, and 
the other remains free and is piaced on the i-list. If an (i + 1)-block is also un- 
available, an (i + 2)-block is split into two (i + 1)-block buddies, one of which is 
placed on the (i + 1)-list and the other of which is split into two i-blocks. One of 
these i-blocks is allocated and the other is placed onto the i-list. If no (i + 2)-block 
is free, this process continues until either an i-block has been allocated or an m-block 
is found to'be unavailable. In the former case, the allocation attempt is successful: 
in the latter case a block of proper size is not available. 
The buddy system allocation process can best be described as a recursive function 
getblock(n) that returns the address of the block to be allocated, or the null pointer if no 
block of size n is available. An outline of this function follows: i 


find the smallest integer i such that 27 >= n; 
if (the i-list is not empty) { 
p = the address of the first block on the i-list; 
remove the first block from the i-list: 
return(p); 
} /* end if */ 
else /*the i-list is empty*/ 
if (i == m) 
return(nul1); 
else { Pa 
p = getblock(2*+1) ; 
return(nul!); 
else { 
put the i-block starting at location :p on the i-list; 
return(p + 27); 
} /* end if */ 
} /* end if */ 


In this outline, if an (i + 1)-block starts at location p, the two i-blocks into which 
it is split start at locations p and p + 2!. The first of these remains on the free list, and 
the second is allocated. Each block is created by splitting a block of one size higher. 
If an (i + 1)-block is Split into two i-blocks b1 and b2, bl and b2 are buddies of each 
other. The buddy of an i-block at location P is called the i-buddy of p. Note that a block 
at location p can have several buddies but only one i-buddy. 

If an i-block is freed and its i-buddy is already free, the two buddies are combined 
* into the (i + 1)-block from which they were initially created. In this-way a larger free 

block of storage is created to satisfy large requests. If the i-buddy of a newly treed 
i-block is not free, then the newly freed block is Placed directly on the i-list. 

Suppose that a newly freed i-block has been combined with its previously free 
i-buddy into an (i + 1)-block. It is possible that the (i + 1)-buddy of this recombined 
(i + 1)-block is also free. In that case the two (i + 1)-blocks can be recombined further 
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into an (i + 2)-block. This process continues until a recombined block is created whose 
buddy is not free or until the entire memory is combined into a single m-block. 
The liberation algorithm can be outlined as a recursive routine liberate(alloc, i) 
that frees an i-block at location alloc: 
\ z 
if (i == m) or (the i-buddy of alloc is not free) 
add the i-block at alloc to the i-list 
else { 
remove the i-buddy of alloc from the i-list; 
combine the i-block at alloc with its i-buddy; 
p = the address of the newly formed (7 + 1)-block; 
liberate(p,i + 1); ‘ 
} /* end if */ 


Let us refine the outline of liberate; we leave the refinement of getblock as an exercise 
for the reader. f 

There is one obvious question that must be answered. How can the free status 
of the f-buddy of alloc be established? Indeed, how can it be determined whether an 
i-buddy of alloc exists at all? It is quite possible that the i-buddy of alloc has been 
split and part (or all) of it is allocated. Additionally, how can the starting address of the 
i-buddy of alloc be determined? If the i-block at alloc is the first half of its containing l 
(i + 1)-block, its i-buddy is at alloc + 2'; if the i-block is the second half of its con- 
taining block, its i-buddy is at alloc — 2'. How can we determine which is the case? 

At this point it would be instructive to look at some examples. For illustrative 
purposes, consider an absurdly small memory of 1024 (=2'°) words. Figure 9.3.7a 
illustrates this memory after a request for a block of 100 words has been filled. The 
smallest power of 2 greater than 100 is 128 (= 27). Thus, the entire memory is split 
into two blocks of size 512; the first is placed on the 9-list, and the second is split into 
two blocks of size 256. The first of these is placed on the 8-list and the second is split 
into two blocks of size 128, one of which is placed on the 7-list and the second of which 
is allocated (block B1). At the bottom of the figure, the starting addresses of the blocks 
on each nonempty i-list are indicated. Make sure that you follow the execution of the 
functions getblock and liberate on this and succeeding examples. 

Figure 9.3.7b illustrates the sample memory after filling an additional request for 
50 words. There is no free 6-block; therefore the free 7-block at location 768 is split into 

. two'6-blocks. The first 6-block remains free, and the second is allocated as block B2. In 

Figure 9.3.7c three additional 6-blocks have been allocated in the order B3, B4, and BS. 
When the first request is made, a 6-block at location 768 is free, so that no splitting-is 
necessary. The second request forces the 8-block at 512 to be split into two 7-blocks and 
the second 7-block at 640 to be split into two 6-blocks. The second of these is allocated 
as B4, and when the next request for a 6-block is made, the first is also allocated as B5. 

Note that in Figure 9.3.7a the block beginning at 768 is a 7-block, whereas in 
Figure 9.3.7b it is a 6-block. Similarly, the block at 512 is an 8-block in Figures 9.3.7a 
and b, but a 7-block in Figure 9.3.7c. This illustrates that the size of a block cannot be 
determined from its starting address. However, as we shall soon see, a block of a given 
size can start only at certain addresses. 
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Figure 9.3.7 


Figure 9.3.7d illustrates the ‘situation after blocks B4 and B3 have been freed. 
When block B4 at location 704 is freed, its buddy is examined. Since B4 is a 6-block 
that is the second half of the 7-block from which it was split, its buddy is at location 
704 — 26 = 640. However, the 6-block at location 640 (which is B5) is not free; there- 
fore no combination can take place. When B3 is freed, since it is a 6-block and was the 
first half of its containing 7-block, its 6-buddy at 768 + 2° = 832 must be examined. 
However, that 6-buddy is allocated, so again, no combination can take place. Notice 
that two adjacent blocks of the same size (6-blocks B4 and B3 at 704 and 768) are free 
but are not combined into a single 7-block. This is because they are not buddies; that 
is, they were not originally split from the same 7-block. B4 can be combined only with 
its buddy BS, and B3 can be combined only with its buddy B2. 

In Figure 9.3.7e, the 7-block Bl has been freed. B1 is the second half of its 
8-block; therefore its 7-buddy is at 896 — 27 = 768. Although bleck B3, which starts 
at that location is free, no combination can take piace. This is because olock B3 is 
not a 7-block, but only a 6-block. This means that the 7-block Siarting at 768 is split 
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Figure 9.3.7 (cont.) 


and therefore partially allocated. We see that it is not yet ready for combination. Both the 
address and the size of a given free block must be considered when making a decision 
as to whether or not to combine buddies. x 

_ In Figure 9.3.7f, the 6-block B5 at location 640 is freed. B5 is the first half of its 
containing 7-block; therefore its 6-buddy is at 640 + 26 = 704. That 6-buddy (block 
B4) is already free; therefore the two can be combined into a single 7-block at 640. 
That 7-block is the second half of its containing 8-block; therefore its 7-buddy is at 
640 — 27 = 512. The 7-block at that location is free, so that the two 7-blocks can be 
combined into'an 8-block at 512. That 8-block is the first half of its containing 9-block; 
therefore its-8-buddy is at 512 + 28 = 768. But the block at location 768 is a 6-block 
rather than an 8-block; therefore no further combination can take place. 

_ These examples illustrate that it is necessary to be able to determine whether a 
given i-block is the first or second half of its containing (i + 1)-block in order to compute 
the location of its i-buddy. 
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Clearly, there is only: one m-block in memory, and its starting location is 0. 
When this block is split, it produces two (m — 1)-blocks Starting at location 0 and 277!” 
These split into four (m — 2)-blocks at locations 0, 2™-2, 2m-1 and 3 * 2™-2, In general, 
there are 2”~! i-blocks starting at locations that are integer multiples of 2', For example, 
ifm = 10 (memory size is 1024), there are 2!9-6 = 16 six-blocks starting at locations 
0, 64, 128, 192, 256, 320, 384, 448, 512, 576, 640, 704, 768, 832, 896, and 960. Each 
of these addresses is an integral multiple of 64 (which is 2°), from 0 * 64 to 15 * 64. 

Notice that any address that is the starting location of an i-block is also the start- 
ing location of a k-block for all 0 <= k < i. This is because the i-block can be split 
into two (i — 1)-blocks, the first of which begins at the same location as the i-block. 
This is consistent with the observation that an integral multiple of 2‘ is also an inte- 
gral multiple of 2'~'. However, the reverse is not necessarily true, A location that is 
the starting address of an i-block is the starting address of an (i + 1)-block only if the 
i-block is the first half of the (i + 1)-block, but not if it is the second half. For exam- 
ple, in Figure 9.3.7, addresses 640 and 768 begin 7-blocks as well as 6-blocks; and 
768 begins an 8-block as well. However, addresses 704 and 832 begin 6-blocks but not 
7-blocks. 

After making these observations, it is easy to determine whether a given i-block 
is the first or second half of the (i + 1)-block from which it was split. If the start- 
ing address p of the i-block is evenly divisible by 2‘*', the block is the first half of 
an (i + 1)-block, and its i-buddy is at p + 2'; otherwise it is the second half of an 
(i + 1)-block and its buddy is at p — 2'. 

We can therefore introduce a function buddy(p,i) that returns the address of 
the i-buddy of p [we use an auxiliary function expon(a,b) that computes a’]: 


if (p % expon(2,i + 1) == 0) 
return(p + expon(2,i)); 
else 
return(p - expon(2,i));- E 
Now that the address of a newly freed block’s i-buddy can be found, how can 
we determine whether or not that buddy is free? One way of making that determina- 
tion is to traverse the i-list to see ‘whether a block at the desired address is present. 
If it is, it can be removed and combined with its buddy. If it is not, the newly freed 
i-block can be added to the i-list. Since each i-list is generally quite small [because as 
soon as two i-buddies are free they are combined into an (i + 1)-buddy and removed 
from the i-list], this traversal is fairly efficient. Furthermore, to implement this scheme, 
each i-list need not be doubly linked, since a block is removed from the i-list only 
after the list is traversed, so that its list predecessor is known. j 
An alternative method that avoids list traversal is to have each block contain a 
flag to indicate whether or not it is allocated. Then when an i-block is freed it is possible 
to determine directly whether or not the block beginning at the address of its buddy 
is already free. However, this flag alone is insufficient. For example, in Figure 9.3.7e, 
when 7-block B1 at location 896 is freed, its buddy’s starting address is calculated as 
768. The block at 768 is free and its flag would indicate that fact. Yet the two blocks 
at 768 and 896 cannot be combined because the block: at 768 is not a 7-block but a 
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6-block whose 6-buddy is allocated. Thus an additional power field is necessary in each 
block..The value of this integer field is the base-two logarithm of its size (for example, 
if the block is of size 2', the value of power is i). When an i-block is freed, its buddy's 
address is calculated. If the power field at that address is i and if the flag indicates that 
the buddy is free, the two blocks are combined, 

Under this method, the i-lists are required only for the allocation algorithm so that 
a block of proper size can be found efficiently. However, because blocks are removed 
from the i-lists without traversing them, the lists must be doubly linked. Thus each free 
block must contain four fields: free, power, prev, and next. The last two are pointers to 
the previous and next blocks on the i-list. An allocated block need contain only the flag 
field. a 

We present the second method of liberation, leaving the first to the reader as an 
exercise. We assume an array of pointers list[m+ 1), where list[i] points to the first block 
on the i-list. We also replace the recursive call to liberate by a loop in which succes- 
sively larger blocks are combined with their buddies until a block is formed whose 
buddy is not free. The algorithm liberate(alloc, i) frees an i-block at location alloc. For 
completeness, let us establish that buddy(p, m) equals 0. The flag free is true if the 

block’is free, and false otherwise. 


P = alloc; 
bud = buddy(p, i); i 
while (i < m && free(bud) == TRUE & power(bud) == 7) { 
/* remove i-buddy of p from the i-list */ 
q = prev(bud); 
if (q = mul) 
list[i] = next(bud); 
else f 
next(q) = next(bud); 
if (mext(bud) != nu71) 
prev(next(bud)) = q; 
/* combine the i-block at p with its buddy */ 
if (p % expon(2,7 + 1) != 0) 
/* the combined block begins at bud */ 
p = wud; 
i++; 
bud = buddy(p,i); f 
/* attempt to combine the larger block with its buddy */ - 
F  /* end while */ 
/* add the i-block at p to the i-list */ 
q = list[i); 
prev(p) = null; 
next(p) = q; 
list[i] = p; 
if (q != nul) 
prev(q) = p; 
/* adjust the fields on the i-block */ 
power(p) = 7; . 
Free(p) = TRUE; 
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Other Buddy Systems 


The buddy system that we have just considered is called the binary buddy 
system, based on the rule that when an i-block (of size 2!) is split, two equal-sized 
(i — 1)-blocks are created. Similarly, two i-blocks that are buddies can be joined into a 
single (i + 1)-block. 

There are, however, other buddy systems in which a large block is not nec- 
essarily split into two equal-sized smaller blocks. One such system is called the 
Fibonacci buddy system. The sizes of the blocks in this system are based on the 
Fibonacci numbers first introduced in Section 3.1; Instead of blocks of size 1, 2, 4, 
8, 16, ...as in the case of the binary buddy system, the Fibonacci buddy system 
uses blocks of size 1, 2, 3, 5, 8, 13, ... when an i-block (the size of an i-block in 
this system is the ith Fibonacci number) is split into two blocks, one of the blocks 
is an (i ~ 1)-block, and the other is an (i — 2)-block. Thus, for example, a 9-block 
(of size 34) may split into an 8-block (size 21) and a 7-block (size 13). Similarly, 
the buddy of an i-block may be either an (i + 1)-block or an (i — 1)-block. In the 
former case, recombination produces an (i + 2)-block, and in the latter case, an 
(i + 1)-block is produced. ; 

Another alternative buddy system is the weighted buddy system. In this scheme 
a block of size 2% is split into two blocks, one of size 24-2? and the other of size 3:« 24-2, 
‘For example, a block of size 64 splits into two buddies of sizes 16 and 48. Rules for 
recombination are similar. 

The philosophy behind such schemes, in which blocks are split into unequal 
subblocks, is that requests for storage are usually not for sizes that match those 
of the blocks in the system. „Thus, the next larger size block must be used, with 
the result that space is wasted within the block. For example, in the binary buddy 
system, when a request is made for a block of size 10, a block of size 16 will 
be allocated (resulting in 6 wasted bytes); in the Fibonacci buddy system, how- 
ever, a block of size 13 may be allocated (resulting in only 3 wasted bytes); in the 
weighted buddy system, a block of size 12 will be sufficient (resulting in only 2 wasted 
bytes). It is not always the case that the Fibonacci system results in less wasted space 
than the binary system (for example, a request for a block of size 15), but in general, 
allowing blocks of varying sizes will more likely produce a closer fit than will requiring 
groups of blocks to be of uniform size. 

An alternative to the foregoing approach is to combine smaller blocks into larger 
ones only when necessary. In such a scheme, called a recombination delaying buddy 
system. when a block is freed it is returned to the list of blocks of its size. When a block 
of a particular size is required, the list of blocks of that size is searched. If a block of 
the required size is found, the search halts successfully; otherwise a search is made for 
a pair of blocks of the next smaller size that are buddies. If such a pair exists, the two 
blocks are combined to form a single block of the required size. If no such pair exists, 
this process is repeated recursively with successively smaller blocks until either a block 
of the required size can be formed fiom smaller blocks, so that the search is successful, 
or until it is determined that the required block cannot be formed from smaller blocks. 
Blocks of larger sizes will also be searched to determine if a split is feasible. If a block 
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of the desired size cannot be found either by splitting larger blocks or by combining 
smaller blocks, the search ends in failure. 

The philosophy behind this scheme is that smaller blocks are often returned to the 
available pool only to be called for again. Instead of recombining the smaller blocks into 
a larger block only to decompose it again, the smaller blocks are retained and are recom- 
bined into larger blocks only when blocks of the larger size are necessary. The disadvan- 
tage of this approach is that blocks of larger sizes may not-be available when there is, in 
fact, enough memory to'satisfy their requests. For example, there may be three i-blocks 
available, two of which are buddies. If a request for an.i-block arrives and one of the 
i-buddies is used to satisfy this request, a subsequent request for an (i + 1)-block cannot 
be satisfied. If, on the other hand, the two i-blocks were recombined into an (i + 1)-block 
first, the request for an i-block would be satisfied from the isolated i-block before an at- 
tempt would be made to break up an (i + 1)-block. (Of course, it is possible to place the 
i-block on the i-list in such a way that i-buddies are always at the rear of the list. This 
prevents the allocation of one of a pair of buddies before an isolated block of the required 
size is allocated. However, when it is necessary to allocate one of several pairs of buddies, 
it may be difficult to select the pair that will allow subsequently larger recombinations.) 

Yet another variation of the buddy system is the tailored list buddy system. In 
this system, instead of maintaining the lists in blocks as large as possible (the standard 
system) and instead of not combining buddies until blocks of a larger size are necessary 
(the recombination delaying system), the blocks a are distributed on the various lists in 
preassigned proportions. 

If the relative frequency of requests for blocks of the possible block sizes are 

known, memory may be divided initially into blocks of the different sizes according 
to the given distribution. As blocks are called for and returned to the pool, a record is 
kept of the actual number of blocks of each block size. When a block is returned to the 
pool and the number of blocks of that size is at or greater than the number specified by 
the distribution, an attempt is made to combine the block into a block of the next larger 
size. This précess is repeated successively until no. such recombination is possible or 
. until the number of blocks of each size is not exceeded. 
When a block of a particular size is requested and there is no block of the required 
size, a block may be formed either by splitting a block of larger size or by recombining 
several blocks of smaller size. Various allocation strategies can be used in this case. Very 
often the distribution of requests is not known in advance. In such a case it is possible to 
allow the distribution of blocks to stabilize slowly, by maintaining a record of the actual 
distribution of requests as they arrive. The desired distribution will probably never be 
achieved exactly, but it can be used as a guide ify determining whether and when to 
recombine blocks. f 

There are two primary disadvantages to buddy systems. The first is internal frag- 
mentation. For example, in the binary buddy system, only blocks whose sizes are inte- 
gral powers `of 2 can be allocated without waste. This means that a little less than half 
the storage in each block could be wasted. The other disadvantage is that adjacent free 
blocks are not combined if they are not buddies. However, simulations have shown that 
the buddy system does work well and that once the pattern of memory allocations and 
liberations stabilizes, splitting and combinations take place infrequently. 


Storage Management Chap. 9 


EXERCISES 


9.3.1. 


9.3.2. 


93.3. 


9.3.4. 


9.3.5. 


9.3.6. 


9.3.7. 


9.3.8. 


Let s be the average size of an allocated block in a system that uses compaction. Let r 
be the average number of time units between block allocations. Let m be the memory 
size, f the average percentage of free space, and c the average number of time units 
between calls on the compaction algorithm. If the memory system is in equilibrium 
(over a period of time, equal numbers of blocks are allocated as are freed), derive a 
formula for c in terms of s, r, m, and f. 
Implement the first-fit, best-fit, and worst-fit methods of storage allocation in C as fol- 
lows: Write a function getblock(n) that returns the address of a block of size n that is 
available for allocation and modifies the free list appropriately. The function should 
utilize the following variables: * 

memsize, the number of locations in memory 

memory|memsize), an array of integers representing the memory 

freeblock, a pointer to the first location of the first free block on the list 
The value of size(p) may be obtained by the expression memory|p}. and the value of 
next( p) by the expression memory[p + 1). 
Revise the first-fit and best-fit algorithms so that if a block on the free list is less than 
x units larger than a request, the entire block is allocated as is, unsplit. Revise the C 
implementations of Exercise 9.3.2 in a similar manner. 


Revise the first-fit algorithm and its implementation (see Exercise 9.3.2) so that the 

free list is circular and is modified in each of the following ways. 

(a) The front of the free list is moved up one block after each allocation request. 

(b) The front of the free list is reset to the block following the block that satisfied the 
last allocation request. 

(c) Ifa block is split in meeting an allocation request, the remaining portion of that 
block is placed at the rear of the free list. 

Whatare the advantages and disadvantages of these methods over the method presented 

in the text? Which of the three methods yields the smallest average Search time? Why? 


Design two liberation algorithms in which a newly freed node is placed on the front of 
the free list when no combinations can be made. Do not use any additional fields other 
than next and size. In the first algorithm, when two blocks are combined the combined 
block is moved to the front of the free list; in the second, the combined block remains 
at the same position in the free list as its free portion was before the combination. What 
are the relative merits of the two methods? 

Implement the liberation algorithm presented in the text in which the free list is or- 
dered by increasing memory location. Write a C function liberate(alloc, n) that uses 
the variables presented in Exercise 9.3.2, where alloc is the address of the block to be 
freed and n is its size. The procedure should modify the free list appropriately. 


Implement a storage management system by writing a C program that accepts inputs 
of two types: An allocation request contains an ‘A’, the amount of memory requested, 
and an integer that becomes the idenjifier of the block being allocated (that is, block 
1, block 2, block 3, and so on). A liberation request contains an ‘L’ and the integer 
identifying the block to be liberated. The program should call the routines getblock 
and liberate programmed in Exercises 9.3.2 and 9.3.6. 

Implement the boundary tag method of liberation as a C function, as in Exercises 9.3.2 
and 9.3.6. The values of size( p) and bsize( p) should be obtained by the expression 
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9.3.9, 


9.3.10. 


9.3.11. 


9.3.12. 


9.3.13. 


9.3.14. . 


abs(memory| p)), fflag( p) and bflag( p) by ( memory|p] > 0), next( p) by memory 

[p + 1], and prev(‘p) by memory | p + 2]. 

How could the free list be organized to reduce the search time in the best-fit method? 

What liberation algorithm would be used for such a free list? 

A storage management system is in equilibrium if as many blocks are allocated as are 

liberated in any given time period. Prove the following about a system in equilibrium. 

(a) The fraction of total storage that is allocated is fairly constant. 

(b) If adjacent free blocks are always combined, the number of allocated blocks is 
half the number of free blocks. 

(c) If adjacent free blocks are always combined, and the average size of an allocated 
block is greater than some multiple k of the average size of a free block, the 
fraction of memory that is free is greater than A/(k + 2). 

Present allocation and liberation algorithms for the following systems. 

(a) _ Fibonnaci buddy system 

(b) Weighted buddy system 

(c) Recombination delaying buddy system 

(d) Tailored list buddy system 


Refine the outline of getblock, which is responsible for allocation in the buddy system, 
* into a nonrecursive algorithm that explicitly manipulates free lists. 


Prove formally (using mathematical induction) that in the binary buddy system: 
(a) There are 2”-' possible i-blocks. 


(b) The starting address of an i-block is an integer multiple of 2'. 
_Implement the binary buddy system as a set of C programs. 
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